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Preface 



Simulation plays a major role in computer aided design of integrated circuits (ICs). 
Mathematical models describe the dynamical processes and interactions of elec- 
trical devices. Verification of a circuit's behavior by means of solving these model 
equations in time and frequency domain is a mandatory task in the design process. 
The structures' sizes are decreasing, the packing density increases and so do the 
driving frequencies. This requires to use refined models and to take into account 
secondary, parasitic effects. The very high dimensional problems that emerge in 
this way may be solvable with the help of computer algebra in an unreasonable 
amount of time only. Clearly, this conflicts with the short time-to-market demands 
in industry. 

Model order reduction (MOR) presents a way out of this dilemma. Redun- 
dancies are resolved, less relevant quantities are replaced by the most significant 
ones. In this way, the problem's complexity is reduced, keeping the main char- 
acteristics. Solving lower dimensional problems one can get statements on the 
circuit's performance more quickly. 

This book surveys the state of the art in the challenging research field of MOR 
for integrated circuits (ICs), and also addresses future research directions in this 
area. Special emphasis is put on aspects stemming from miniaturization to nano- 
scale. Contributions cover complexity reduction using e.g., balanced truncation, 
Pade-approximation/Krylov-techniques, or POD approaches. For semiconductor 
applications, a focus is on generalizing current techniques to differential algebraic 
equations, on including design parameters, on preserving stability and other 
physical properties as well as model structures, and on including nonlinearities 
by means of piecewise linearizations along solution trajectories (TPWL) and 
interpolation techniques for nonlinear parts. Furthermore, the influence of inter- 
connect and powergrids on the physical properties of the device is considered, and 
also top-down system design approaches in which detailed block descriptions are 
combined with behavioral models. MOR faces the same requests as those 
appearing in Response Surface Modeling (RSM) techniques for robust design, 
with emphasis on parameterization, parameter screening, and nonlinearity. Further 
topics consider MOR and the combination of approaches from optimization and 
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Statistics, and the inclusion of Partial Differential Equation (PDE) models with 
emphasis on MOR for the resulting partial differential algebraic systems. 

It is a proven fact that semiconductor industries see MOR as a key tool for 
simulations of nano-technology designs, with high demand of well educated 
experts in the field. This implies that timely education is needed. The current 
number of books in this area is very limited, so that this volume helps to fill a gap 
in providing the state of the art material, and to stimulate further research in the 
area of MOR for ICs. Furthermore, the methods which currently are being 
developed have also relevance in other application areas such as mechanical 
multibody systems, mechatronics, micro- and nano-technology, chemical and 
biological processes, etc. 

Furthermore, this book reflects and documents the vivid interaction between 
three active research projects in this area, namely the EU-Marie Curie Action ToK 
project O-MOORE-NICE' (members in Belgium, The Netherlands and Germany), 
the EU-Marie Curie Action RTN-project COMSON^ (members in The Nether- 
lands, Italy, Germany, and Romania), and the German federal project System 
Reduction for Nanoscale IC Design (SyreNe)r 

The material collected in this book reflects the contributions to the workshop 
"Model Reduction in Circuit Simulation", held at University of Hamburg, October 
30-31, 2008. The workshop was jointly organized by the abovementioned projects 
SyreNe and 0-Moore-Nice ! , supported by the Federal Ministry of Education and 
Research (BMBF) of Germany and the EU FP6 programme "Marie Curie Industry 
Host Fellowships", respectively. 

Part I of the book consists of overview papers of tutorial value corresponding to 
the invited presentations. Chapter 1 by Wilhelmus H. A. Schilders describes the 
need for novel MOR techniques in the electronics industry and thus provides a 
good motivation for the the technical papers to follow. In Chap. 2, Roland Freund 
derives the mathematical equations describing linear RCL circuits (which are the 
main tool for modeling interconnect and other parasitic effects in IC design) and 
presents Pade and Pade-type approximation methods respecting the structure of 
typical mathematical RCL models. A different approach to MOR is taken by 
Tatjana Stykel in Chap. 3 where the application of balancing-related methods from 
systems theory to circuit equations is surveyed. Special emphasis is put there on 
the exploitation of the topological structure implied by the electrical network. In 
Chap. 4, Sanda Lefteriu and Athanasios C. Antoulas pave the way to macro- 
modeling using measured data rather than starting from model equations. The 
main principle employed here is (tangential) interpolation using the Loewner 
matrix framework. 

In Part II, contributed papers are collected. Chapters 5 and 7 provide different 
MOR methods and techniques to serve parameterized problems. In Chap. 5, 



www.tu-chemnitz.de/mathematik/mdustrie_technik/projekte/omoorenice/. 
^ www.comson.org for details. 
See www.syrene.org for details. 
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surrogate models (or RSMs) based on table models and neural network models are 
built that are valid for a range of parameters. These are applied to solve multi- 
input, multi-output (MIMO) problems. In Chap. 7, parameterized response func- 
tions are using expansions into bivariate rational basis functions. The coefficients 
are determined by optimizing a special cost function. 

The Chaps. 6, 13 and 16 consider techniques to overcome problems associated 
with having massive numbers of inputs and outputs (or terminals, ports). In 
Chap. 6, recycling of Krylov sub spaces is applied to sequentially solve problems 
with changing right-hand sides occurring in an approach based on superposition of 
single input systems. Several recycling variants are studied for the Generalized 
Conjugate Residual method as basic linear solver. In Chap. 13, the sources are 
approximated by a finite expansion in basis functions. This allows for replacing 
current sources by a reduced number of controlled current sources. The network 
with the replaced sources remains passive and reciprocal. In Chap. 16, the situation 
where the large number of ports originates from validation problems that include 
many parasitics is considered. Extended SVD-based MOR (ESVDMOR) uses 
SVD on the input and output super matrices of the moments of the transfer 
function. A variant uses truncated SVD. Both approaches lead a system with 
virtual input and output operators and reduced number of terminals. For both 
approaches, stability, passivity, and reciprocity can be preserved under reasonable 
assumptions. 

The Chaps. 8 and 9 address two different problems with circuits involving 
inductors. In Chap. 8, the transfer function at a node that is a candidate to be 
eliminated within the TICER method is first checked for the contribution coming 
from an attached inductor element. When passivity is preserved, RC couplings 
replace the first order inductor effect. After this the TICER step can be completed. 
In Chap. 9, the phase shift effect in a periodic steady state solution of an oscillator 
is determined by defining appropriate perturbation vectors covering the effects 
from other parts of the circuit. This allows to efficiently solve a balun circuit. The 
simulation time is further reduced by applying MOR (balanced truncation) to the 
RC models for the transmission lines coupling the oscillators. 

In Chap. 10, POD-based model order reduction for semiconductors in electrical 
networks is investigated, where the semiconductor is modeled by the drift diffusion 
equations. The greedy approach is used to construct POD models which are valid 
over certain parameter ranges (frequencies). Numerical investigations for the 
reduction of a 4-diode rectifier network indicate that POD surrogate models 
depend on the position of the semiconductor in the network. 

Chapter 1 1 deals with periodic linear systems as they arise in harmonic balance. 
It extends the notion of causal and non causal reachability and observability 
Gramians to discrete sequences of periodic descriptor systems (for instance arising 
after linearising around a periodic steady state solution). Periodic projected 
Lyapunov equations can be defined as well as (non-)causal Hankel singular values. 
This framework allows to extend MOR by balanced truncation to this class of 
problems. 
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In Chap. 12, synthesis of reduced models is considered: how to formulate those 
as netlist using RLC components and to reduce the amount of controlled sources. 
Foster synthesis only applies to partial fraction expansion of the transfer function. 
It is shown how MOR techniques that preserve the main structure of the equations 
and the input-output connectivity, formulated for problems in impedance form 
(current in, voltage out), can also be applied to problems in admittance form 
(voltage in, current out). This allows studying a second method, unstamping via 
RLCSYN. After synthesizing the reduced impedance model, this can be converted 
to an admittance model. 

Structure preserving port-Hamiltonian reduction is introduced in Chap. 14 for 
electrical circuits. Formulation of a port-Hamiltonian state space system can easily 
guarantee stability. Kalman decomposition results in a controllable/observable 
port-Hamiltonian system. Balancing preserves the port-Hamiltonian structure of 
the equations. 

In Chap. 15, symbolic reduction and simplification of expressions in nonlinear 
circuit equations is combined with reduction of blocks by numerical MOR tech- 
niques. The method relies on a hierarchical structure of the circuit. 

In Chap. 17, MOR techniques like POD and Trajectory Piece-Wise Linear 
(TPWL) for nonlinear problems are considered, together with variants (like 
adapted POD, DEIM). The effect of the chosen variant on the quality of the 
approximations is considered as well as the efficiency and the robustness with 
respect to variations of the input sources. 

In Chap. 18 an approach to nonlinear balancing and MOR is considered. For 
linear systems the notion of sliding interval balancing (SIB) is introduced, and it is 
shown that subsystems conserve stability. The truncated SIB realization bounds 
the optimal approximation in an input-output sense. Nonlinear balancing is 
approached by applying SIB on the linear variational system. 

Hamburg, August 2010 Peter Benner 

Michael Hinze 
E. Jan W. ter Maten 
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Chapter 1 

The Need for Novel Model Order 
Reduction Techniques in the 
Electronics Industry 



Wil H. A. Schilders 



Abstract In this paper, we discuss the present and future needs of the electronics 
industry with regard to model order reduction. The industry has always been one 
of the main motivating fields for the development of MOR techniques, and con- 
tinues to play this role. We discuss the search for provably passive methods, as 
well as passivity enforcement methods that are currently being developed. 
Structure preservation is another important research topic, for which new concepts 
are being developed. This also holds for the calculation of delays in long inter- 
connect lines, a topic that leads to an entirely new type of methods. Topics that are 
still in their infancy are model order reduction for parameterized and nonlinear 
problems. We will discuss what the needs of the industry are in all of these fields, 
show specific applications and what has been achieved so far. The paper is meant 
as a guideline for future research, not as a detailed survey of existing methods. 



1.1 Introduction 

Both in the area of dynamical systems and control, and in numerical analysis, a 
wealth of model order reduction (MOR) techniques have been developed over the 
years. Balanced truncation, Krylov subspace methods, proper orthogonal decom- 
position and other SVD-based methods are just a few classes of methods that have 
been described. The electronics industry has been one of the main providers of 
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motivating examples, and indeed model order reduction has found widespread use 
in this industry. 

The starting point for MOR in the electronics industry is usually attributed to a 
method termed asymptotic waveform evaluation (AWE) [33]. The underlying idea 
of this method is simple, approximating the moments of the transfer function of the 
system. The idea being that moments will decay, so that calculating a sufficient but 
finite number of moments will eventually lead to an accurate approximation of the 
transfer function. Soon after its publication, it was realized that the method suffers 
from numerical problems that are similar to those encountered when applying the 
power method to calculate eigenvalues and eigenvectors of a system. The columns 
of the coefficient matrix generated to calculate the solution will tend more and 
more towards the eigenvector corresponding to the largest eigenvalue, which 
means that we will soon find this matrix to be very ill-conditioned. This is exactly 
what is observed when applying AWE: after 8-10 iterations, the process stagnates, 
and no additional useful information about moments is being generated. This 
stagnation is graphically illustrated in Fig. 1.1. 

In order to overcome these numerical difficulties associated with AWE, 
Feldmann and Freund [13] proposed the use of Krylov subspace type methods. 
Within the area of numerical linear algebra, such methods had been suggested in the 
1950s, and shown to be capable of avoiding the ill-conditioning of the coefficient 
matrices encountered in the power method. In fact, Lanczos already used a similar 
technique to determine eigenvalues and eigenvectors of symmetric matrices. Again, 
the idea is fairly simple: rather than generating a sequence of vectors that quickly 
become (almost) linearly dependent, after each iteration an orthogonalisation step is 
performed. The effect is that the same subspace is produced, but now with an 
orthonormal basis. Stated differently, information already contained in previous 
iterates is projected out of the newly generated vectors, and only the new information 
is taken into account. Not surprisingly, this also provided the solution to the problems 
arising in AWE. The method suggested by Feldmann and Freund was termed 
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Pade-via-Lanczos (PVL) [13], and shown to match the moments of the transfer 
function. It demonstrated that Krylov subspace methods, developed mainly in the 
area of numerical linear algebra, can also be used to perform model order reduction. 

With the advent of PVL, the way was paved for new developments in MOR. 
In the past decade, the electronics industry has continued to formulate challenging 
research topics in the field of model order reduction. Although PVL eliminated the 
problem of ill-conditioning of the coefficient matrix associated with the calcula- 
tion of the moments of the transfer function, it turned out to lead to non-passive 
approximations in some cases. Indeed, it is fair to expect that reduced order 
models for passive systems should also be passive. Thus, the invention of PVL was 
soon followed by the search for provably passive Krylov subspace methods. In 
1998, the first of such methods was published (PRIMA [32], based on the Arnoldi 
process), soon followed by the Laguerre-type methods [27, 30]. PVL itself was 
also remedied, although the SyPVL and SymPVL methods [15] do not seem to 
have found widespread use; this also holds for Laguerre-type methods. Other work 
that should be mentioned here is that by Antoulas and Sorensen on passivity 
preservation based on methods using spectral zeros, in which forward and adjoint 
systems are treated together [3]. 

Since the beginning of the new century, developments in MOR for problems in 
the electronics industry have been stepped up. Engineers formulated more and 
more constraints and requirements, thereby formulating many interesting topics for 
researchers in the field of model order reduction. An important question was, for 
example, how to assess the accuracy of methods. Or, equivalently, how can we 
guarantee that the reduced order model generated is a sufficiently accurate 
approximation of the original transfer function. For methods based on Krylov 
subspaces, this question has not been answered so far, so that the iterative process 
is usually stopped when the difference between subsequent iterations is below a 
certain threshold. Methods used in the area of dynamical systems and control, most 
prominently those based on Hankel singular values, are equipped with good and 
reliable error estimates. The counterside is that these methods involve the solution 
of Lyapunov equations, which is limited to fairly small systems. Hence, research is 
now concentrating on extending the applicability of these methods to larger sys- 
tems, often making use of the sparsity [7]. 

Another constraint that was formulated fairly recently is the preservation of the 
structure of systems. Again, this is a natural requirement. This has led to the 
development of classes of structure-preserving methods, of which SPRIM [17] and 
structure-preserving PMTBR [14] are examples. Although these methods have 
provided new ideas to perform MOR for electronic systems, new developments are 
still needed in this area. One of the problems is that the methods suggested so far 
destroy the structure of the incidence matrix that relates the different types of 
unknowns (such as currents and voltages), leading to a fully dense system of 
couplings. This also holds for the method described by Vandendorpe and Van 
Dooren [45] for interconnected systems, although, at first glance, this method 
appears to preserve the sparse interconnections. Also interesting in this context is 
the paper [16]. 
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Despite all efforts, many unsolved problems remain, and the electronics industry 
is anxiously waiting for solutions. In this paper, the demands of the electronics 
industry with respect to model order reduction will be detailed. In Sect. 1.2, we 
briefly sketch where these needs arise, and place the problem in its context. We start 
by discussing, in Sect. 1.3, the problems of passivity and realization, the latter 
meaning that the mathematically reduced order model should be cast into a phys- 
ically realizable model mimicking properties of the original model. In Sect. 1 .4 it 
will be shown that structure preservation is still not in a state that is acceptable to 
the business, and that different concepts and methods are needed. 

Another important problem is that of the reduction of networks with many 
inputs and outputs, the topic of Sect. 1.5 Krylov methods cannot handle such 
problems, and methods that make use of singular value decompositions are not 
applicable in most cases. We will also discuss methods for delay equations, a fairly 
recent development in MOR for long interconnect lines. Section 1.6 gives some 
details on the requirements associated with this problem. 

Last, but certainly not least, the state of nonlinear and parameterized model 
order reduction techniques is far from mature enough to help the industry cope 
with their problems in behavioral and response surface modeling. Section 1.7 
discusses some of the issues, and the state-oft-the-art in this important research 
area. Future developments will probably be concentrated in this area. Section 1.8 
is the conclusion of this paper, with a summary of the current and future needs of 
the electronics industry. 

It should be noted that the discussion in this paper will not be exhaustive; 
several other problems in the electronics industry lead to requirements for model 
order reduction techniques, and often these are the subject of research. Further- 
more, the main emphasis will be on Krylov subspace methods, whence some 
readers may feel that solutions can also be provided by methods originating from 
the area of dynamical systems and control theory. Besides not being exhaustive, 
the discussion will also not be elementary, in the sense that some background 
knowledge by the reader is required, both in the electronics industry and in the area 
of model order reduction. For the latter, this background information can conve- 
niently be obtained from the surveys found in the books [4, 8, 37, 43]. For the 
former, a wealth of books is available, as well as many websites. 



1.2 Mathematical Problems in the Electronics Industry 

The electronics industry is dealing with the design of computer chips. As is well 
known, the development over the years can be fairly well described by Moore's 
Law, as displayed in Fig. 1.2. It has various possible interpretations, but the 
bottom line is that every 1 8 months a new generation of chips is borne that is twice 
as fast as the previous generation. The dimensions are dramatically reduced from 
generation to generation, and the operating frequencies rise exponentially. 
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Fig. 1.3 Illustration of the scale problem in the electronics industry 



Dealing with computer chips means having to deal with complexity. Simulation 
of the behavior of semiconductor devices, electronic circuits and interconnect 
structures is a huge mathematical problem, in which we need to deal with scales 
that vary up to 10'\ This scale problem is illustrated in Fig 1.3. In the left picture, 
we see a close-up of the wiring in the interconnect structure, with details of 
100 nanometers. The middle picture shows a 50 GHz integrated inductor, for 
which the scale is in the order of 500 |J,m. The picture on the right shows a close- 
up of the Intel Itanium chip (source: McKinley, 2002) for which approximately 
10 km of interconnect are needed to connect all components. 

Fortunately, we see a development in numerical methods that is similar to 
Moore's law. Since the advent of iterative methods for solving linear systems, 
occurring at the core of all simulation programmes, the size of systems that can be 
solved has increased exponentially. Most notable developments are preconditioned 
conjugate gradient methods and the multigrid method. Figure 1.4 shows the 
development of these methods in a period of 30 years (see also [49]). 

When considering the mathematical problems in more detail, we should dis- 
tinguish various levels, as indicated by the scale issue discussed before. At the top 
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Fig. 1.4 "Moore's law" for 
numerical methods in solving 
linear systems 
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level, we need to deal with electronic circuit simulation which encompasses large 
(millions) discrete networks containing resistors, capacitors, inductors, diodes, 
transistors, and other components. The discretisation of this system is fairly simple, 
as it obeys Kirchhoff's laws. The models used for active devices, such as diodes and 
transistors, are extremely nonlinear, meaning that non-standard methods are needed 
to solve the associated nonlinear systems. The linear systems generated in this 
process often have a hierarchical structure and are sparse, but may also be indefi- 
nite. Transient simulations are also non-trivial, due to the sudden changes occurring 
due to externally applied signals. AC analysis for high frequencies requires the 
solution of large complex linear systems. Pole-zero analysis and harmonic balance 
are also worth mentioning, but there are some comments to be made here. Normal 
AC analysis has a matrix mC + G in which m — i ■ 2n f. Clearly, here the size of 
the system equals the size of C and G (i.e. defined by the design). In Harmonic 
Balance more harmonics are involved in one coupled system. So here the number of 
harmonics influences the size of the system. In nonlinear problems high frequency 
effects may result in the need to increase the number of harmonics. In Periodic AC 
(PAC), i.e. studying small perturbations after linearizing around a PSS solution, two 
frequencies have to be taken into account: that of the PSS solution and the one of 
the perturbation of the right hand side. Usually, a high frequency PSS solution 
needs many time points for a proper time-domain representation and to be able to 
study the effects of the perturbations to sum and difference frequencies /osc ±/pert- 
The reader is referred to [9, 26], where MOR is studied for systems arising in PAC 
analysis. 

Not surprisingly, the development of robust and efficient methods for electronic 
circuit simulation has taken a considerable number of years (since the 1970s). 
Even nowadays it is certainly not a trivial task, with new developments still being 
published, and new simulation software containing the latest advances in numer- 
ical methods being brought to the market. The additional benefits of using MOR is 
also a key topic in recent years. 
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At a lower level, we encounter the simulation of individual semiconductor 
devices. This is again a complicated mathematical problem. Often, one uses the 
model consisting of three coupled partial differential equations, together consti- 
tuting the drift-diffusion model. The system is singularly perturbed, meaning that 
special discretization techniques are needed to obtain accurate solutions on rela- 
tively coarse meshes. The resulting discrete system of equations is again extremely 
nonlinear, requiring special numerical methods for their solution. Damped Newton 
methods were developed, as well as nonlinear block Gauss-Seidel methods, the 
latter being known in the field as Gummel's method. Nonlinear variable trans- 
formations have also been shown to provide successful solution techniques. In the 
1980s and 1990s, many researchers have been working on this problem, both from 
academia and within large electronics companies like IBM, AT&T, and Philips. 
Although the set of methods for the drift-diffusion equations is by now fairly well 
accepted and stationary [39], this is certainly not the case for the extended models 
such as the energy transport model or the hydrodynamic set of equations [2]. 

Although electronic circuit simulation and semiconductor device simulation are 
now at a fairly mature level, new problems have occurred in the mean time. At 
high frequencies, electromagnetic effects in the interconnect structure (see 
Fig. 1.5) may start to influence the behavior of circuits and devices. This leads to 
delay of signals, malfunctioning devices, and undesired effects in the substrate. 
The Maxwell equations need to be solved to address these issues, either by using a 
full wave solution using methods like FDTD, FIT, BEM, and FVM, or approxi- 
mate methods like PEEC [35]. The resulting extremely large systems must be 
reduced to acceptable size to be coupled with the underlying circuit. These are 
complex linear systems, which are difficult to solve at high frequencies. The 
eigenvalues depend on the frequency, and iterative methods have difficulties to 
solve such systems. 

An important problem for the discussion in subsequent sections is that, even if 
we would be able to solve such large systems by iterative or direct methods, circuit 



Fig. 1.5 Interconnect 
structure above the silicon 
containing the circuit and 
devices 
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simulation programmes will not be able to cope with these large systems. The 
problem is that a coupled simulation needs to be carried out, and the commonly 
used way to obtain this coupling is to translate the discretized electromagnetic 
system into a linear circuit, and solve this in conjunction with the original non- 
linear electronic circuit using a circuit simulator. As this is not a feasible approach, 
one needs to first reduce the electromagnetic system to an acceptable size. This is 
one of the reasons why model order reduction has received much attention within 
the electronics industry. Reducing the electromagnetic system is indeed possible, 
as we are only interested in the dominant electromagnetic effects that influence the 
underlying circuit. Therefore, not all detail contained in the full discretized system 
is needed, and the use of model order reduction is fully justified. 

Model order reduction is not only used for the reduction of the electromagnetic 
system generated by the interconnect structure, but also by the substrate region 
below the individual semiconductor devices. This area of a chip also needs to be 
CO- simulated with the electronic circuit, in order to account for signal propagations 
taking place within the substrate region. Often, this leads to fairly simple resistor 
or RC networks, the problem being that these systems are again very large. Hence, 
model order reduction is needed to reduce these substrate systems to acceptable 
size, so that they can also be co-simulated with the electronic circuit. 

Having demonstrated the need for model order reduction, in the remainder of 
this paper we will address several issues that are encountered when performing this 
task. Once again, we stress that the discussion is not exhaustive. There are many 
papers describing the research on suitable MOR methods used in the electronics 
industry, and the reader is referred to these for more information. 



1.3 Passivity and Realizability 

As described in the introduction, the Pade-via-Lanczos method [13] was the 
starting point for many new developments in numerical methods for model order 
reduction. As we also discussed, it was soon realized that the method can generate 
non-passive reduced order models despite the fact that the original system was 
passive. The search for provably passive methods was started, and methods like 
PRIMA [32] and Laguerre-type methods [27, 30] are now routinely used. Thus, the 
problem appears to have been solved satisfactorily, and no further research is 
needed. 

However, the latter is not entirely the case. The first problem encountered in 
practice is the possibility that the discretisation of the original problem does not 
guarantee a passive system. This may be caused in electromagnetic simulations, 
for example, by a mesh that is too coarse. Another example is when using the 
boundary element method for the simulation of printed circuit boards, when the 
fourfold integrals have not been calculated to sufficient accuracy (this is especially 
crucial for the diagonal elements, which contain singular integrals). In these cases, 
there is little hope that the reduced system will be passive. 
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A related problem is that the reduced order model is solely based upon 
experimentally obtained results, for example measurements of S-parameters. 
Again, due to inaccuracy in the data, the state space system constructed from these 
may have poles in the wrong halve of the complex plane, and be non-passive. 
Again, this leaves not much hope for passivity of the reduced order system that 
should capture the dominant behavior of the experimentally generated data (see 
also [28]). 

It should be noted that a tendency is observed to increasingly perform MOR on 
data obtained fully or partially from experiments. Antoulas and his group [31] 
have done work on Loewner matrices and shifted Loewner matrices, that can 
conveniently be used to generate reduced order models based on available data. 
These methods split the data into two disjoint sets, and then perform a computa- 
tional procedure using these so-called Loewner matrices. The data may come from 
measurements, and can also be combined with simulation results. In fact, one 
could also perform a huge number of simulations, and use these for the set of data. 
These Loewner approaches lead to linear state space models that interpolate 
exactly [5, 28]. 

From the foregoing it is clear that passivity is certainly not a closed chapter in 
the book of MOR! However, the emphasis has shifted from provably passive 
methods to techniques for passivity enforcement. This topic is receiving quite 
some attention in recent years. It is also an important topic, as it is essential that 
rational models be passive in order to avoid unstable time domain simulations such 
as shown in Fig. 1.6. Thus, several researchers are concentrating on passivity 
enforcement methods, some from the numerical analysis point of view, others in 
the area of dynamical systems and control. For example, in [19], building on 
earlier work (2004), a general class of a posteriori passivity enforcement algo- 
rithms is considered based on iteratively perturbing the Hamiltonian matrix. The 
authors present a frequency-weighting scheme leading to the definition of a 
modified Gramian that, when employed during passivity enforcement, effectively 
leads to control of the relative error. 



Fig. 1.6 Unstable time 
domain simulation for non- 
passive reduced order model 
of a low pass filter (see also 
[44]) 
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A different technique is described in [23], where a fast approach for passivity 
enforcement of pole-residue models is obtained by perturbing the eigenvalues of 
the residue matrices, as opposed to the existing approach of perturbing matrix 
elements. This leads to large savings in computation time with only a small 
increase of the modeling error. This fast residue perturbation (FRP) approach is 
merged with the modal perturbation technique, leading to fast modal perturbation 
(FMP). Usage of FMP over FRP achieves to retain the relative accuracy of the 
admittance matrix eigenvalues. A complete approach is obtained by combining the 
passivity enforcement step with passivity assessment via the Hamiltonian matrix 
eigenvalues and a robust iteration scheme, giving a guaranteed passive model. 

Also worth mentioning is the work in [36]. A nice and recent survey of pas- 
sivity enforcement methods is provided in [20], and also at the recent SPI con- 
ference [25] interesting new developments were presented. There are, however, 
some remaining issues that are important from the electronics industry point of 
view. Most methods developed so far trade accuracy for passivity. In other words, 
the reduced order model can be made passive only at the expense of lower 
accuracy. This is not surprising, as the model was constructed in such a way that 
its transfer function gives a sufficiently accurate approximation of the transfer 
function of the original system. By changing the location of only a few poles, 
namely those causing the non-passivity, the accuracy may be seriously affected. 
A 5% change in the S-parameters (to obtain values not exceeding 1) will often lead 
to at least 5% loss of accuracy. This is not acceptable from an engineering point of 
view. The only solution for this problem appears to be an optimization approach 
that specifies the constraints, and changes the location and residues of all poles, 
until both passivity and sufficient accuracy are obtained. Techniques suggested so 
far avoid this optimization problem by performing iterations on a limited number 
of poles, but we feel it is necessary to invest in the more expensive approach. 
If this is achieved, then the chapter of passivity can be closed, and the engineering 
community within the electronics industry will also be much happier. 

A topic that is closely related to passivity is that of realizability. A well known 
theorem tells us that passive systems are automatically realizable. In his thesis, 
Brune (1931) shows that any passive transfer function can be realized by some 
passive network. He also provides rules for the construction of such networks, a 
drawback being that he is forced to employ transformers with perfect coupling. 
From an engineering point of view, this is undesirable, and it is more attractive to 
use so-called Foster synthesis, dating back to Foster (1924). 

Although this may, therefore, appear to be a problem solved long ago (see also 
[21] for a survey dating back to 1957), again this is not the case at closer 
inspection. Foster realization (or similar techniques) often leads to an extremely 
large number of resistive, capacitive and/or inductive elements. Thus, the size of 
the reduced order model is dramatically increased after realization. This is mainly 
due to the fact that the matrices in the reduced order model often have lost sparsity, 
so that all unknowns in the reduced order model are mutually coupled. Indeed, this 
dense structure is one of the stumbling blocks for efficient circuit synthesis of 
reduced order models. Other obstacles are that there may be negative values of 
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resistors, capacitors and inductors, and, last but absolutely not least, that the 
methods often work only for single input single output (SISO) systems. For 
multiple input multiple output (MIMO) systems, the approaches often fail, or have 
not even been described for this case. 

Recently, a method that makes use of "un-stamping" was published by Yang 
[50]. The method is termed RLCSYN, and heavily relies on block structure pre- 
serving MOR methods (see next section). This may indeed be the only way to 
avoid the mutual coupling between all elements in the reduced order system. So 
far, however, experiences with the method are lacking, and time will tell whether it 
is the solution to the aforementioned stumbling blocks. 

A point of discussion in the electronics industry is, of course, whether or not 
one needs to realize/synthesize the reduced order models. The reasoning is simple: 
suppose we have an electronic circuit, and are able to reduce it using mathematical 
techniques. If we perform the intermediate step of realizing the reduced order 
model, we obtain a smaller circuit that can be treated by a circuit simulator. But 
the circuit simulator will immediately start "discretizing" the circuit, translating 
the equations back to the mathematical reduced order model. Hence, it appears 
unnecessary to make the effort of realizing and/or synthesizing a reduced order 
model. However, there is a need within the electronics industry. A synthesized 
reduced order model can provide more insight to engineers and designers than the 
reduceld order model in mathematical form. In fact, as we shall see in Sect. 1.7, 
the compact models constructed by device physicists for transistors and diodes are 
translations of exactly these insights. Thus, there is a need in the industry for such 
realization and synthesis techniques. It would be even better if this could be done 
for parameterized reduced order models 



1.4 Structure Preservation 

When the discussions about passivity were drastically reduced after the publication 
of PRIMA and the Laguerre-type methods, a new topic emerged in the MOR 
community. In fact, this was initiated by the needs of the electronics industry, 
where it is a given that electronic circuits are formulated in terms of voltages and 
currents. Hence, the coefficient matrices in the state space systems can be regarded 
as consisting of blocks that correspond to these two types of unknowns. Unfor- 
tunately, conventional MOR methods do not make use of this knowledge. In fact, 
Krylov subspace methods like PVL immediately destroy this underlying structure, 
by mixing information that is of "current type" with that of "voltage type". This 
implies that the unknowns in the reduced order model do not have a physical 
interpretation. 

Clearly, the foregoing is rather undesirable, and there is a clear need in the 
electronics industry to maintain the underlying structure of the problem within the 
context of MOR. This has led to new methods that are now termed structure- 
preserving MOR methods. The first to be published was that of Roland Freund, 
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termed SPRIM [17]. The idea is simple, yet very clever: first, a Krylov subspace is 
generated using conventional MOR methods. However, rather than using the 
vectors generated as a basis for the Krylov subspace, Freund suggests to split each 
vector into two vectors, by splitting it into a vector containing only voltage 
unknowns, and a vector containing only current unknowns. In the vector con- 
taining the current unknowns, the voltage entries are all set to zero, and similarly 
for the vector containing the voltage unknowns. In this way, a basis is formed that 
consists of twice the number of basis vectors as the basis for the space constructed 
originally. The big advantage is, however, that it can now be shown that there will 
be no mixing of voltages and currents, meaning that it should be possible to assign 
a physical meaning to each of the vectors. 

Unfortunately, despite the nice properties of SPRIM (such as matching double 
the number of moments, see [17]), the method was not felt by the industry to be the 
ultimate answer to the question of structure preservation. In fact, it was soon 
noticed that the incidence matrix, i.e. the matrix that relates voltages and currents, 
is destroyed completely. Originally, it is a very sparse matrix with at most two 
elements per row (1 and —1, relating branches and nodes), but using SPRIM it 
immediately turns into a dense matrix. This means that all currents and voltages in 
the reduced system are coupled, which is to be avoided in view of subsequent 
realization and synthesis (see Sect. 1.3). 

Only recently, a remedy for the aforementioned problem has been presented. 
The method in [6] uses a clever transformation of variables before the reduction 
process starts, and then uses SPRIM. The transformation used is that which 
translates branch currents, arising in the frequently used MNA formulation of 
electronic circuits, into loop currents. Such transformations are also very helpful in 
methods for the solution of indefinite linear systems, as has been shown in [38]. In 
fact, long time ago, circuit simulation was usually done using the so-called "mesh 
method", which is precisely using loop currents rather than branch currents. With 
the advent of fast computers, the preference became MNA and using branch 
currents, but from a mathematical point of view, this is certainly not the preferred 
choice. In any case, Bai et al. [6] make use of the loop currents, the big advantage 
being that the incidence matrix becomes an identity matrix. That is, the (l,2)-block 
in the matrix will be identity, the corresponding (2,1) block is not affected. Hence, 
we immediately see a drawback of the method, namely it destroys the original 
symmetry. 

The method proposed by Bai et al. [6] is certainly promising, and in a further 
publication it has been shown to be capable of addressing the realization problem 
discussed in the previous section. However, it is too early to state that all problems 
have been solved. There is still the clear need in the electronics industry to address 
the structure preservation issue, and it is the feeling of the author that other 
methods need to be explored. For example, when considering an electronic circuit, 
it usually contains more branches than nodes. This means that we have more 
unknowns of the type "current" than of the type "voltage". Structure preserving 
methods suggested so far generate the same number of basis vectors of both types. 
This seems unnatural. Adding a new basis vector for the voltages, should probably 
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Fig. 1.7 Structure 
preserving MOR for coupled 
systems 
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lead to a number of basis vectors added for the currents. So that, in the end, having 
a full basis for the voltages at the same time leads to a full basis for the currents. In 
the presently used methods, this is not the case. Most probably, the answer needs 
to be found in graph theory, but currently it is not known how to do this. 

In the foregoing, we have concentrated on electronic systems containing cur- 
rents and voltages, reflecting a structure consisting of two types of unknowns. Of 
course, the structure may be much more refined. From a theoretical point of view, 
we could have interconnected systems as considered by Vandendorpe and 
Van Dooren (see [37], Chap. 14). More general (but very much related to that of 
Vandendorpe and Van Dooren) is the approach considered in [14]. The graphical 
interpretation shown in Fig. 1.7 clearly shows the idea: keep the structure of the 
original systems, and do not destroy it during the MOR phase. The picture also 
shows the drawback of the method developed so far, namely that the coupling 
blocks become dense after reduction. Here, a combination with the ideas of Bai 
et al. [6] could provide a solution. However, as before, we feel that other methods, 
possibly from graph theory, need to be investigated in order to arrive at a fully 
acceptable solution. 

Summarizing, the problem of structure preservation remains unsolved, despite 
various useful attempts. The feeling within the electronics industry is that novel 
concepts and ideas are needed. 



1.5 Reduction of JMIIVIO Networks 



Although not mentioned explicitly, so far the discussion has been limited to sys- 
tems that are of single input single output type. Of course, several of the methods 
presented can also be applied to the MIMO case, but often this will reveal some 
hidden problems. One very clear disadvantage of Krylov subspace methods 
applied to MIMO systems is that the number of basis vectors added to the subspace 
is equal to the number of inputs and outputs of the system. In the SISO case, one 
vector is added at the time, but in the MIMO case, many vectors are added in each 
iteration. Thus, the dimension of Krylov subspaces quickly becomes large, in 
many cases even prohibitively large. In [27], this problem has been analyzed, and a 
block-Arnoldi algorithm was proposed that reduces the blocks of vectors that is 
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added per iteration. Experiments conducted with this method show the advantage, 
but it is too marginal to be very effective in practical situations. Hence, new 
concepts need to be developed. 

One new concept that has recently been investigated, and published in another 
chapter in this book, is the use of a mixture of graph theoretical and numerical 
methods to perform some kind of "preconditioning" of the original system. In fact, 
it is similar to the successive node elimination (SNE) technique [41]. Instead of 
applying Krylov subspace methods to the system, related to iterative solution 
methods for linear systems, the method employs a more direct approach of 
eliminating unknowns, corresponding to direct solution techniques for linear 
systems. We do not wish to repeat the information that is in the chapter elsewhere 
in this volume, and refer the reader to it for more details. What can be said is, 
however, that there is a very clear and relatively urgent need by the electronics 
industry to be able to drastically reduce such networks with many so-called 
"ports". As has been demonstrated in Sect. 1.2, both for the interconnect system 
and for the substrate, huge resistive (and sometimes RC networks) are generated 
that need to be co-simulated with the underlying electronic circuit. This is only 
feasible in present-day circuit simulators if these huge networks are reduced very 
substantially. 



1.6 MOR for Delay Equations 

A fairly recent development is concerned with delay extraction-based passive 
macromodeling, associated with multiconductor transmission line type intercon- 
nects. High-speed effects influencing a signal propagating on an interconnect could 
be multifold, such as delay, rise time degradation, attenuation, crosstalk, skin 
effect, overshoots, undershoots, ringing, and reflection. Concerning propagation 
delay, a signal traversing from one end of a transmission line to the other end takes 
a finite amount of time, resulting in the signal experiencing a certain amount of 
delay. In addition, the signal may encounter rise time degradation, where the rise 
time at the receiver end is larger than the rise time at the source end. Rise-time 
degradation further adds to the overall delay experienced by the signal. 

Concerning signal reflection and the associated ringing, this can also severely 
distort signal propagation at higher frequencies. The prime cause of this is the 
discontinuity in characteristic impedance of the transmitting line, due to the 
impedance variation on a line taking place over a certain length. This can occur 
due to the change in the medium along the length of the signal trace, which may 
have to traverse several layers. 

It turns out that conventional model order reduction techniques do not work 
well for this case, characterized by multiport (Y, Z, S, or the transfer function) 
tabulated parameters. In [10] an algorithm is described that approximates the 
logarithm of the transfer function using a low-order rational function. Subse- 
quently, the DEPACT (delay extraction-based passive compact transmission line 
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macromodel) algorithm is applied to obtain a passive and causal macromodel for 
SPICE simulation. This method leads to compact, low-order macromodels that are 
more reliable for time-domain simulations. 

More recent developments can be found in [25], where several papers discuss 
the issue of MOR for delay equations. The most commonly used approach now- 
adays appears to be an adapted version of vector fitting. Rather than using only 
rational functions, it is suggested to have an expansion which is a mixture of 
exponential and rational terms. The exponential terms then account for the delay, 
and can account also for reflections. The "coefficients" of these exponential terms 
are then rational functions as used in the vector fitting approach. 

Although simulations carried out so far suggest that the foregoing approach is 
the way to go, there is also the clear disadvantage that the methods are only suited 
for delay equations. Stated differently: if one includes the exponential terms, one 
can account for effects in long interconnect lines, but this expansion is not very 
suitable for short lines. If one does not include the exponential terms, the normal 
vector fitting approach is used, and one will find that a prohibitively large number 
of terms is needed in the expansion when used for long interconnect lines. Hence, 
the essential question is, whether a combined approach can be found that addresses 
both long and short interconnect lines. This is a clear need in present-day simu- 
lations of interconnect structures, also for extraction programmes [24]. 



1.7 Parameterized and Nonlinear JVIOR 

As will have become clear from the foregoing sections, the state-of-the-art con- 
cerning MOR for linear problems is relatively well developed, although there are 
still many open questions and remaining problems. For parameterized and non- 
linear problems, model order reduction techniques are still in their infancy. 

When looking at parameterized MOR, the electronics industry has clear needs. 
Modern analog designs are often built using the concept of programmable cells 
(PCells, PCircuits) and the ROD/Skill programming language. All of these 
introduce parameters, either in the definition of the circuit sub blocks (Pcells), or 
the relation between sub blocks (ROD, relative object design). Often, these sub 
blocks are optimized with respect to a certain objective function on the so-called 
schematic level, but optimization needs to be performed again on the higher layout 
level, in order to account for parasitic effects that might take place due to inter- 
actions between the different components. Clearly, these problems cannot be 
addressed by an MOR technique that only uses numbers. We need to be able to 
perform MOR also when unknown parameters are in the system. 

So far, several researchers have addressed this problem, but the techniques 
suggested are often not very general, or not capable of solving the aforementioned 
complex problems. Obvious extensions using assumptions on the linearity are 
often suggested, whereas what is needed are new concepts. Rather than using 
Taylor expansions in the parameters, leading to a multitude of cross terms, it is the 
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Fig. 1.8 Dependence of 
poles on parameters 
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feeling of the author that we need a careful analysis of the poles of the transfer 
function in terms of the parameters. A development in this direction that is felt to 
be very promising is the sensitive pole algorithm (SPA), that shows the depen- 
dence of the location of poles on parameters. In Fig. 1.8 an example is shown (see 
[34]). 

It is exactly this dependence that is important. For the Maxwell equations, for 
example, part of the eigenvalue spectrum appears to be independent of the 
parameter "frequency", whereas another part is heavily dependent on the fre- 
quency. Hence, it should be possible to develop MOR methods that make use of 
this knowledge, and only consider the poles that are most sensitive to changes 
of parameters. This would be the author's advice to researchers working in the area 
of parameterized MOR. The electronics industry would be extremely delighted 
when better methods, not as involved as the methods suggested so far, would "see 
the light". A potentially very interesting development is that using symbolic 
approximations. The reader is referred to [40, 48] for more information. 

For nonlinear problems, the situation is similar to that of parameterized MOR. 
Some methods have been suggested, but often these are based on linearizations of 
the underlying problem. This can never be effective for general nonlinear prob- 
lems, and is far from the needs of the electronics industry. In the area of Krylov 
subspace type methods, the trajectory piecewise linear method (TPWL) (Fig. 1.9) 
appears to be most popular; see also the chapter by Striebel in this volume. That 
chapter contains a wealth of information on the state-of-the-art of model order 
reduction for nonlinear problems [42]. In the area of dynamical systems and 
control, methods developed by Fujimoto and Scherpen [18] and Verriest [46] are 
promising, although so far the methods are limited to small nonlinear systems. 

The question is, however, whether one can expect to be able to develop MOR 
methods for general nonlinear systems. In the author's opinion, larger nonlinear 
systems can only be tackled by using specific information about the underlying 
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Fig. 1.9 Schematic 
illustration of the TPWL 
method 
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nonlinear system. A nice example is provided by the simulation of voltage con- 
trolled oscillators. These are devices that are used frequently in electronic designs. 
Initiated by Roychowdhury [12], tremendous achievements have been obtain in the 
reduction of such systems. In Fig. 1.10 a simple oscillator schematic is shown, 
whereas in Fig. 1.11 the behaviour is shown in case it is in the so-called unlocked 
state. 

The theory developed for oscillators is based on insight in the behaviour of such 
devices. The circuit describing these oscillators often consist of about 100 com- 
ponents, but their behaviour can be reduced to a single equation describing the 
phase noise. In fact, reduction to a single equation is possible as one knows that 
only the eigenvalue 1 plays a role in the final behaviour, all other modes will damp 
out. This example illustrates that it can be very effective to use detailed knowledge 
in order to obtain an adequate and effective reduction of the nonlinear system. 

The electronics industry needs more of these specific ideas for the reduction of 
nonlinear components. In fact, physicists and device engineers have performed 
model order reduction already since the 1980s, but their activities are termed 
"compact modeling" rather than reduced order modeling. Initially, compact 
models were made in the form of so-called Gummel-Poon models (see Fig. 1.12). 

More generally, the idea of compact modeling is to capture the full nonlinear 
behaviour of a semiconductor device into a model consisting of resistors, capac- 
itors, inductors and simple nonlinear components like diodes, containing many 
parameters such as channel width, channel length, and doping profile. Compact 
modeling has become a discipline in itself, a good and solid reference being [29]. 
In recent years, efforts have mainly concentrated on compact models for the 
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Fig. 1.11 Frequency domain 
behaviour of unlocked 
oscillator 
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Fig. 1.12 Gummel-Poon 
model used in compact 
modeling of semiconductor 
devices 




various generations of MOS transistors, building blocks of most electronic circuits 
nowadays. The Penn State Philips (PSP) model is the most recent model, now 
being accepted as the world standard [1]. 

It would be great if the MOR community could provide ideas for the automatic 
generation of such compact models. Thus far, such models have been constructed 
manually, using much insight and a huge number of experimental results being 
generated over years. In view of the latter, it is to be expected that successfully 
generated reduced order models must also make use of huge numbers (millions) of 
measured or simulated (via semiconductor device simulation) results. 
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A final remark is that the work presented in [11, 46, 47] is also of very much 
interest in the context of nonlinear MOR, and might provide the breakthroughs the 
industry is waiting for. 



1.8 Summary: Present and Future Needs of the Electronics 
Industry 

In this chapter, we have touched upon many subjects within the field of model 
order reduction as viewed from the electronics industry. The conclusion is that the 
area of linear problems is fairly well developed, but that many problems still 
remain, with new research topics being generated regularly. In the areas of 
parameterized and nonlinear model order reduction, many problems remain, and 
these areas still need many new ideas in order to get the status of being mature. 
Specific topics that we identified in this paper, and are worth stressing again: 

• Good and reliable error estimates for Krylov subspace based MOR techniques 

• Passivity enforcement in such a way that accuracy is not traded for passivity 

• More natural structure preserving methods that use different spaces for currents 
and voltages 

• Efficient reduction methods for systems with multiple inputs and outputs 

• An MOR approach that can address both long and short interconnect lines, i.e. is 
suitable for situations with and without delay 

• Parameterized model order reduction based on the sensitive pole algorithm 

The ultimate challenge, however, is to be able to automatically generate the 
nonlinear device models that can currently only be constructed by device physicists ! 
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Chapter 2 

The SPRIM Algorithm for Structure- 
Preserving Order Reduction of General 
RCL Circuits 



Roland W. Freund 



Abstract In recent years, order-reduction techniques based on Krylov subspaces 
have become the methods of choice for generating macromodels of large-scale 
multi-port RCL networks that arise in VLSI circuit simulation. A popular method of 
this type is PRIMA. Its main features are provably passive reduced-order models and 
a Pade-type approximation property. On the other hand, PRIMA does not preserve 
other structures inherent to RCL circuits, which makes it harder to synthesize the 
PRIMA models as actual circuits. For the special case of RCL circuits without 
voltage sources, SPRIM was introduced as a structure-preserving variant of PRIMA 
that overcomes many of the shortcomings of PRIMA and at the same time, is more 
accurate than PRIMA. The purpose of this paper is twofold. First, we review the 
formulation of the equations characterizing general RCL circuits as descriptor 
systems. Second, we describe an extension of SPRIM to the case of general RCL 
circuits with voltage and current sources. We present some properties of the general 
SPRIM algorithm and report results of numerical experiments. 

Keywords Krylov subspace • Structure preservation • Reduced-order model • 
Passivity • Electronic circuit • Interconnect 



2.1 Introduction 

The problem of order reduction of linear dynamical systems has a long history and 
many powerful methods for generating provably good reduced-order models have 
been developed. Most of this work was motivated by problems in control theory. 
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where, typically, the sizes of the dynamical systems to be reduced are relatively 
small to begin with. In the early 1990s, the continued miniaturization ('Moore's 
Law') of VLSI circuits lead to the need for efficient order reduction of large-scale 
linear dynamical systems with ever-increasing state-space dimension. The linear 
dynamical systems to be reduced in VLSI circuit simulation are RCL models of 
the VLSI circuit's interconnect system. These RCL models are relatively 'simple', 
but large, electronic networks with resistors, capacitors, inductors, voltage sources, 
and current sources as their only elements. Unlike the typical order-reduction 
problems in control theory, the large state-space dimension of these RCL circuits 
made the use of most of the more powerful reduction methods prohibitive until 
recently. Furthermore, RCL circuits are described by systems of differential- 
algebraic equations, resulting in so-called descriptor systems. Many of the existing 
order-reduction methods could not be used directly for descriptor systems at that 
time. Motivated by this 'new' application in VLSI circuit simulation, starting in 
the early 1990s, there has been tremendous interest in developing techniques for 
order reduction of large-scale descriptor systems.' 

Reduced-order modeling techniques based on Pade or Pade-type approximation 
were quickly recognized to be powerful tools for the problems arising in VLSI 
circuit simulation. The first such technique was asymptotic waveform evaluation 
(AWE) [27], which uses explicit moment matching. More recently, the attention 
has moved to reduced-order models generated by means of Krylov-subspace 
algorithms, which avoid the typical numerical instabilities of explicit moment 
matching; see, e.g., the survey papers [12-14]. 

PVL [8, 9] and its multi-port version MPVL [10] use variants of the Lanczos 
process [23] to stably compute reduced-order models that represent Pade or 
matrix-Pade approximations [4] of the circuit transfer function. SyPVL [18] and its 
multi-port version SyMPVL [11, 19, 20] are versions of PVL and MPVL, 
respectively, that are tailored to RCL circuits. By exploiting the symmetry of RCL 
transfer functions, the computational costs of SyPVL and SyMPVL are only half 
of those of general PVL and MPVL. 

Reduced-order modeling techniques based on the Arnoldi process [2], which is 
another popular Krylov-subspace algorithm, were first proposed in [7, 24-26, 31]. 
Arnoldi-based reduced-order models are defined by a certain Pade-type approxi- 
mation property, rather than Pade approximation, and as a result, in general, they 
are not as accurate as a Pade-based model of the same size. In fact, Arnoldi-based 
models are known to match only half as many moments as Lanczos-based models; 
see [13, 24, 25, 31]. 

In many applications, in particular in VLSI interconnect analysis, the reduced- 
order model is used as a substitute for the full-blown original model in higher-level 
simulations. In such applications, it is very important for the reduced-order model 
to maintain the passivity properties of the original circuit. In [3, 19, 20], it is shown 



See Chap. 3 for recent progress in the adaptation of control-theoretic methods to large-scale 
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that SyMPVL is passive for RC, RL, and LC circuits. However, the Pade-based 
reduced-order model that characterizes SyMPVL cannot be guaranteed to be 
passive for general RCL circuits. On the other hand, in [24-26], it was proved that 
the Arnoldi-based reduction technique PRIMA produces passive reduced-order 
models for general RCL circuits. PRIMA employs a block version of the Arnoldi 
process and then obtains reduced-order models by projecting the matrices defining 
the RCL transfer function onto the Arnoldi basis vectors. While PRIMA generates 
provably passive reduced-order models, it does not preserve other structures, such 
as reciprocity or the block structure of the circuit matrices, inherent to RCL 
circuits. This has motivated the development of the reduction technique SPRIM 
[15], which overcomes these disadvantages of PRIMA. In particular, SPRIM 
generates provably passive and reciprocal macromodels of multi-port RCL cir- 
cuits. Furthermore, SPRIM models match twice as many moments as the corre- 
sponding PRIMA models obtained with identical computational work. 

SPRIM was originally proposed in [15] for the special case of RCL circuits 
without voltage sources. The purpose of this paper is twofold. First, we review the 
formulation of the equations characterizing general RCL circuits as descriptor 
systems. Second, we describe an extension of SPRIM to the case of general RCL 
circuits with voltage and current sources. We present some properties of the 
general SPRIM algorithm and report results of numerical experiments. 

The remainder of this paper is organized as follows. In Sect. 2.2, we review the 
equations describing general RCL circuits and the formulation of these equations 
as descriptor systems. In Sect. 2.3, we present some basic facts about generating 
reduced-order models of descriptor systems by means of Krylov subspace-based 
projection. In Sect. 2.4, we describe the SPRIM algorithm for the case of general 
RCL circuits. In Sect. 2.5, we make some comments about how to reduce the 
number of voltage sources before applying SPRIM to the remaining RCL net- 
work. In Sect. 2.6, we report the results of some numerical experiments. Finally, 
in Sect. 2.7, we mention some open problems and make some concluding remarks. 

Throughout this paper the following notation is used. The set of real and com- 
plex numbers is denoted by R and C, respectively. For (real or complex) matrices 
M — [mjk] , we denote by M^ — [mtj] the transpose of M, and by M^ :— [ffikj] the 
Hermitian (or complex conjugate) of M. The identity matrix is denoted by / and 
the zero matrix by 0; the actual dimensions of / and will always be apparent from 
the context. The notation M ^ (M >- 0) is used to indicate that a real or complex 
square matrix M is Hermitian positive semidefinite (positive definite). If all entries 
of the matrix M ^ (M >- 0) are real, then M is said to be symmetric positive 
semidefinite (positive definite). 

2.2 RCL Circuit Equations 

We consider general RCL circuits driven by voltage and current sources. In this 
section, we review the formulation of such RCL circuits as linear time-invariant 
differential-algebraic equations (DAEs). 
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2.2.1 The Lumped-Element Approach 

The lumped-element approach uses directed graphs to model electronic circuits; 
see, e.g., [6, 28, 30, 32 ]. The edges £ of the graph correspond to the electronic 
elements of the circuit and the nodes Af of the graph correspond to the inter- 
connections of the electronic elements. In this subsection, we review this approach 
for the case of general RCL circuits driven by voltage and current sources. 

The elements of such a general RCL circuit are its resistors, capacitors, 
inductors, voltage sources, and current sources. With each such element we 
associate an edge e ^ £, written as an ordered pair of nodes: 

e = (ni,M2)- 

Here, ni,n2 G A/^ are the nodes of the graph representing the interconnections of 
the element to other circuit elements. We call «i the tail node of e and M2 the head 
node of e. Note that the direction of e is from «i to ^2. For circuit elements for 
which the direction of the electric current through the element is known before- 
hand, the direction of e is chosen accordingly. For all other elements, an arbitrary 
direction of e is assigned. If the computed electric current through such an element 
is nonnegative, then the current flow is in the direction of e\ otherwise, the actual 
current flow is against the direction of e. 

The resulting directed graph Q = {M , £) can be described by its incidence 
matrix the rows and columns of which correspond to the nodes nj £ J\f and edges 
ek G £, respectively. To this end, we set 

1 if edge e^ leaves node «,-, 
ajk = { —^ if edge e^ enters node m,, 
otherwise. 

The rows of this matrix add up to the zero row, and thus the matrix is rank 
deficient. In order to avoid this redundancy, we label one of the nodes as the 
ground node of the circuit and delete the corresponding row. We call the resulting 
matrix A. It has \I\f\ — 1 rows and \£\ columns, where \J\f\ and \£\ denote the 
number of nodes and edges of the graph Q, respectively. If Q (viewed as an 
undirected graph) is connected, then the matrix A has full row rank: 

xim\.A= \U\- 1. 

Note that Q is indeed connected for any real electronic circuit, and thus, this 
condition of full row rank of A is always satisfied. 

The matrix A allows an elegant formulation of the Kirchhoff's laws for the given 
RCL circuit. To this end, we denote by i^ the vector the entries of which are the 
currents along the edges £, by vg the vector the entries of which are the voltages 
across the edges £, and by v the vector the entries of which are the voltages at the 



2 The SPRIM Algorithm for Stmcture-Preserving Order Reduction 



29 



nodes A^, except for the ground node at which the voltage is zero. We remark that 
ig and vg are vectors of length \£\ and v is a vector of length \M\ — 1. Kirchhojf's 
current laws (KCLs) can then be stated compactly as follows: 

Ak - 0. (2.1) 

Kirchhojf's voltage laws (KVLs) have the following compact formulation: 

A^v = V£. (2.2) 

To obtain a complete characterization of the given RCL circuit, the so-called 
branch constitutive relations (BCRs) for the actual circuit elements need to be 
added to the Kirchhoff's laws (2.1) and (2.2). To formulate the BCRs, it is 
convenient to assume that the edges e^ e £ are numbered according to element 
type in the following order: resistors, capacitors, inductors, voltage sources, and 
current sources. The matrix A and the vectors i^ and V£ can thus be partitioned as 
follows: 



v^ — I ./t./' •'t-c •'^Z J^y '•'*■{ \ ; ^S 



Vf 



(2.3) 



Here, the subscripts r, c, I, v, and ; refer to resistors, capacitors, inductors, voltage 
sources, and current sources, respectively. Using the partitions (2.3), the Kirchhoff's 
laws (2.1) and (2.2) can be written as follows: 



Ar ir + Ac ic + Ai ii + Av iv + Ai ii = 0, 

Aj.v = v,., A^v^vc, Ajv^v,, Alv^v,., Ajv . 



(2.4) 



Furthermore, the BCRs for the resistors, capacitors, and inductors can be stated in 
the following compact form: 

Vrit) = Ri,{t), i,{t) = C--Vc{t), vi{t)=L--ii{t). (2.5) 

at at 

Here, R and C are diagonal matrices, the diagonal entries of which are the resis- 
tances of the resistors and the capacitances of the capacitors, respectively, and in 
particular, R>- Q and C ^ Q. The matrix L contains the inductances between the 
inductors as its entries. If mutual inductances are included, then L is a full matrix; 
otherwise, L is also a diagonal matrix. In both cases, L^ Q. 
Therefore, the matrices in (2.5) always satisfy 



R>-Q, C ^ 0, and L ^ 0. 



(2.6) 
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2.2.2 RCL Circuit Equations as Integro-DAEs 

Equations 2.4 and 2.5, together with suitable initial conditions for some initial time 
fo, completely characterize the time behavior of the RCL circuit. Here, 

v„(0 and (,(f), t>to, (2.7) 

are given functions, which describe the voltages and currents provided by the 
voltage and current sources, respectively. The functions 

v(f), v,(f), v,(0, v,(f), V,(f), ir{t), i,{t), ii{t), U{t) (2.8) 

are the unknown solutions of the Eqs. 2.4 and 2.5. These equations can be greatly 
simplified by using the BCRs (2.5) and the formulae for v^, v^, and v; in (2.4) to 
eliminate all but the unknown functions v{t) and i^{t). 
To this end, we now assume that Iq — Q and that 

/,(0) = 0. (2.9) 

The third relation in (2.5) can then be rewritten in the form 

i,{t) ^ L-' f v,{z)dr. (2.10) 



Using (2.10), the first two relations of (2.5), and the formulae for v^, v^., and v; in 
(2.4), it follows that 

t 

;V(f) = R-^A^^t), i,{t) = CAljv{t), i,{t) ^ L-^Aj / v{z)dT. (2.11) 



Inserting these relations into (2.4), we obtain the coupled system of equations 

Enjv{t) - All v(0 + AiL-^Aj / v{T)dT + A,h{t) = -AMt), 



-Alv{t) = -n,(r) 
for the unknown functions v(t) and /,,(?). Here, we have set 

Eu:=AcCAl and Au := -ArR'^Aj. . (2.13) 

The system (2.12) is completed by adding initial values 

v(0) and (,(0) (2.14) 

for the solutions v{t) and (^(0 at initial time to = 0. 
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We remark that the system (2.12) with initial conditions (2.14) represents an 
integro-dijferential-algebraic equation (integro-DAE). This is the most compact 
formulation of the equations describing RCL circuits. By solving the integro-DAE 
(2.12), we obtain the vector v(?) of nodal voltages and the vector i^{t) of currents 
through the voltage sources. The remaining circuit quantities, namely 

Vr{t), Vcit), vi{t), v,(f), ir{t), ic{t) , and ii{t), 

are then readily computed using the KVLs in (2.4), the first two BCRs in (2.5), and 
(2.10). 



2.2.3 RCL Circuit Equations as Descriptor Systems 

While integro-DAEs of the type (2.12) are the most compact formulation of RCL 
circuit equations, they are not directly amenable to Krylov subspace-based 
reduction techniques. Instead, we rewrite (2.13) as a descriptor system. This can be 
done by treating the vector //(f) of inductor currents also as an unknown function, 
along with the functions v{t) and iy{t). We denote by 



x{t) 



v{t) 
ii{t) 
hit) 



(2.15) 



the resulting new state-space vector of unknowns. Furthermore, we define vectors 
of input and output functions 



u{t) 



v„(r) 



and y{t) 



v,(0 
~iv{t) 



(2.16) 



Recall from (2.12) that the entries of the input vector u{t) are all given functions. 
The output vector y(t) is readily obtained from the state-space vector (2.15). 
Indeed, using the relation v,(f) = Ajv{t) from (2.4), it follows that 



y{t) = B^x{t), where B := 



'Ai 














-/ 



(2.17) 



We now re-introduce ii{t) into (2.12). To this end, we employ the formula for 
(/(r) stated in (2.11) and the differentiated form thereof: 



Lj^ii{t)^A]v{t). 
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The resulting equivalent version of the system (2.12) can be stated as follows: 



Eu-v{t) - Auv{t) + Aiii{t) + AMt) 



-Aiii{t), 









Lj,iM 


-Ajvit) 
-Alv{t) 


= 0, 


0- 


Finally, setting 
















'En 


0" 






[All 


~A, 


~A 


E:^ 





L 


and A 


= 


Aj 










. 


0. 
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(2.18) 



(2.19) 



and adding the relation (2.17) and (2.18), we obtain the following formulation of 
the RCL circuit equations as a descriptor system: 

Ef^4t)=Mt)+Buit), (2.20) 

y{t)^B^x{t). 

The system (2.20) is completed by the initial conditions from (2.9) and (2.14): 

-v(0)- 
x(0) = xo := 

In the following, we denote by A^ and m the length of the state-space vector x{t) and 
the length of the input vector u(t) of (2.20), respectively. The number A^ is called 
the state-space dimension of the descriptor system (2.20). We remark that m is also 
the length of the vector-valued output vector y(t) of (2.20). In particular, A and 
E are N x N matrices, and B is an A' x m matrix. Note that, in view of (2.16), m is 
the total number of voltage and current sources in the RCL circuit. 

The formulation (2.20) is essential for using Krylov subspace techniques for 
model order reduction of RCL circuits. Furthermore, the matrices A and E in (2.20) 
need to be such that the matrix pencil 



sE 



s e 



(2.21) 



is regular, i.e., the matrix sE — A is singular only for finitely many values of 

Regularity of the matrix pencil (2.21) is equivalent to certain rank conditions 
involving the subblocks of the matrix A in (2.3). Indeed, the matrix pencil (2.21) is 
regular if, and only if, the matrix 

Av has full column rank (2.22) 



and the matrix 



[ Ar Ac Ai Ay ] has full row rank. 



(2.23) 



2 The SPRIM Algorithm for Stracture-Preserving Order Reduction 33 

For an elementary proof of this characterization of regularity, we refer the reader 
to [17, Theorem 1]. 

The rank conditions (2.22) and (2.23) have simple interpretations in terms of 
the RCL circuit described by the descriptor system (2.20). Condition (2.22) means 
that the subcircuit consisting of only the voltage sources of the given RCL circuit 
has no closed (undirected) loops. Condition (2.23) means that the (undirected) 
graph corresponding to the subcircuit obtained from the given RCL circuit by 
deleting all current sources is still connected. Both these conditions are satisfied 
for any practically relevant RCL circuit, and thus from now on, we always assume 
that the matrix pencil sE — A associated with the descriptor system (2.20) is 
regular. Furthermore, we stress that the occurrence of a singular matrix pencil is a 
strong indication that some error was made in the design or the modeling of the 
RCL circuit described by (2.20). 



2.2.4 Passivity 

Any RCL circuit is passive, i.e., it consumes energy (provided by the voltage and 
current sources), but does not generate energy. 

One of the possible mathematical characterizations of passivity uses the con- 
cept of the transfer function of a descriptor system. Recall that the matrix pencil 
(2.21) is assumed to be regular. Therefore, we can define an m x m-matrix-valued 
function by setting 

H{s) -.^B^isE-Ay^B, seC. (2.24) 

Note that His a rational function, with possible poles at those finitely many values 
of 5 e C for which the matrix sE — A is singular. The function (2.24) is called the 
transfer function of the descriptor system (2.20). 

The function (2.24), H, is said to be positive real if it has no poles in the right- 
half 

C+~{seC\Res>0} 

of the complex plane and 

H{s) + {H{s}f h for all seC+. 

It is well known that a system described by (2.20) is passive if, and only if, the 
transfer function of (2.20) is positive real, see, e.g., [1]. 

Since RCL circuits are passive, it thus follows that the transfer functions of the 
circuit equations stated as descriptor systems (2.20) are guaranteed to be positive 
real. An alternative proof of this fact uses the following simple theorem, which can 
be found as Theorem 13 in [13]. 
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Theorem 1 Let A,£ e M'^''"' and B e K.^'^'". Assume that 



(2.25) 



and that the matrix pencil sE — A is regular. Then, the function (2.24), H, is 
positive real. 

In view of this tiieorem, we only need to verify that the matrices A and 
E defined in (2.19) satisfy the conditions (2.25). Note that by (2.6) and (2.13), we 
have 

fill =£[1^0, L^L^^O, and Aii=A[i^O. 

Together with (2.19) it follows that 

E = E'^ >Q and A+A^ = ^0. 



All 





0" 





















Hence the transfer functions of RCL circuit equations stated as descriptor systems 
(2.20) are indeed positive real. 



2.3 Projection-Based Order Reduction 

In this section, we review some basic facts about generating reduced-order models 
of descriptor systems (2.20) by means of projection. From now on, we always 
assume that the system (2.20) describes a given RCL circuit. In particular, the 
matrices A and E are of the form (2.19), and the matrix B is of the form (2.17). 



2.3.1 Reduced-Order Models 

A general reduced-order model of (2.20) is a descriptor system of the same form 
as (2.20), but with a reduced state-space dimension n < N, instead of A'. Thus a 
reduced-order model is of the form 






(2.26) 



where A„, £„ e M'""' and B„ eR"""". Note that the input vector u(t) is the same as 
in (2.20). In particular, the number m is unchanged from (2.20). The output vector 
y{t) of (2.26) is only an approximation of the original output vector y{t) of (2.20). 
In fact, the problem of order reduction is to find a sufficiently large reduced state- 
space dimension n and matrices A,„ £„, and B„ such that the output vector of the 
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reduced-order model (2.26) is a 'sufficiently good' approximation of the output 
vector of the original system (2.20). 
Provided that the matrix pencil 

sEn-A„, see, (2.27) 

associated with (2.26) is regular, we can define a transfer function as before: 

H„{s) := bI{sE„~A„)-'b„, s G C. (2.28) 

In terms of transfer functions, the problem of order reduction is to find a suffi- 
ciently large reduced state-space dimension n and matrices A„, £„, and B„ such that 
the transfer function (2.28), //„, of the reduced-order model is a 'sufficiently good' 
approximation of the transfer function (2.24), H of the original system: 

H„{s) K, H{s) in 'some sense'. (2.29) 



2.3.2 Order Reduction Via Projection 

A very basic approach to constructing reduced-order models is to employ pro- 
jection. Let 

V„ e M'^^" with rank y„ = n (2.30) 

be given. Then, by simply setting 

An := VlAVn, E„ := Vj^V,,, and B„ := vJb, (2.31) 

one obtains a reduced-order model (2.26) that can be viewed as a projection of the 
A^-dimensional state space of the original system onto the n-dimensional subspace 
spanned by the columns of the matrix y„. In particular, projection employs an 
ansatz of the form 

~x{t) = VnX{t) 

for the state-space vector x{t) of the reduced-order model (2.26). Recall that 
x{t) denotes the state-space vector of the original system (2.20). 
There are two appealing aspects of the projection approach: 

1. Reduced-order models obtained by means of projection trivially preserve 
passivity of the original system; see, e.g., [24-26]. Indeed, the only additional 
condition on the matrix (2.30), V„, is that the resulting matrix pencil (2.27) is 
regular. Recall that A and E satisfy the semidefiniteness properties (2.25). The 
definitions of A„ and E„ in (2.31) readily imply that these matrices also satisfy 
(2.25). Therefore, by Theorem 1, the transfer function (2.28), H„ is positive 
real, and thus the corresponding reduced-order model (2.26) is passive. 
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2. By choosing the matrix V„ such that the subspace spanned by its columns 
contains a certain Krylov subspace, one obtains reduced-order models for 
which (2.28), H„, is a Pade-type approximation of the original transfer function 
(2.24), H; see Sect. 2.3.4 below. 



2.3.3 Block Krylov Subspaces 

Suppose we want to evaluate the transfer function (2.24), H, of the descriptor 
system (2.20) at some point sq e C at which the matrix ^o^^ — A is nonsingular. 
The most efficient way of obtaining H(sq) is to first solve the system of linear 
equations 

{sqE - A)X{sq) ^ B (2.32) 

for X{s()) and then compute 

H{so)=B''X{sq). 

The coefficient matrix sqE — A of (2.32) is large, but sparse, and as it is the case 
for all practical circuit matrices, a sparse LU factorization of this matrix can be 
computed with typically little fill-in; see, e.g., [30, 32]. 

The basic idea behind Krylov subspace-based order reduction for electronic 
circuits is to reuse the LU factorization, which was computed to obtain H{so), to 
generate the information contained in the leading Taylor coefficients of //expanded 
about sq. To this end, we rewrite the transfer function il.lA) as follows: 

H{s) = B'^isoE - A + (i - so)Ey^B = B^{I + {s ~ so)My^R, (2.33) 

where 

M -.^ {soE ~ Ay^ E and R -.^ {s^E - Ay^B. (2.34) 

In view of (2.33), the Taylor expansion of H about sq is given by 

H{s)^y{~iyB^M'R {s-soy. (2.35) 






The leading Taylor coefficients of H can thus be obtained by computing inner 
products of the columns of the matrix B and the leading columns of the block 
Krylov matrix 

[r MR M^R ■■■ M'-^R ■■■]. (2.36) 

Let A'niax( <^) denote the rank of this matrix. Then for m = 1,2, . . .,A'max, the nth 
block Krylov subspace (induced by M and R) is defined as the ^-dimensional 
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subspace of C'*' spanned by the first « linearly independent columns of the block 
JCrylov matrix (2.36). In the following, /C,-,(M, /?) denotes this nth block Krylov 
subspace. Note that, by construction, /C,-,(M, /?) contains the necessary information 
to generate at least the first 



(2.37) 



Taylor coefficients of the expansion (2.35) of H about ^o- In the generic case, the 
integer (2.37) is the exact number of Taylor coefficients that can be obtained from 
JCf,{M,R). However, in certain degenerate cases, one can obtain even more 
coefficients; we refer the reader to [16, 17] for a complete characterization of the 
exact number of coefficients. 

We remark that for actual numerical computations, this definition of ICfi{M,R) 
is useless. Instead, one employs a suitable Krylov subspace algorithm to generate a 
numerically well-behaved basis of K.f,{M,R). One such algorithm is the band 
Arnoldi process described in [14]. It produces orthonormal basis vectors for the 
subspaces ICf,{M,R). We remark that Krylov subspace algorithms involve the 
matrix M only in the form of matrix-vector products Mv. For the computation of 
these, the matrix M never needs to be formed explicitly. Instead, in view of the 
definition of M in (2.34), each matrix-vector product Mv can be obtained via one 
sparse multiplication with E and two sparse triangular solves with the LU factors 
of the matrix sqE — A. 

Finally, we stress that for general sq g C, ICf,{M,R) is a subspace of C'*'. If we 
restrict sq to be real, then the matrices M and R are real and ICa {M, R) is a subspace 
ofR'^. 



2.3.4 Krylov Subspace-Based Projection 

We now employ the block Krylov subspaces defined in Sect. 2.3.3 to choose 
suitable projection matrices V„. 

Recall from (2.30) that the matrices V„ are assumed to be real. For the sake of 
generating reduced-order models, this assumption is not essential, and in fact, 
complex N X n matrices can be used as well. However, in order to obtain passive 
models, the matrices V„ need to be real. To guarantee this condition, from now on 
we assume that 

so e M, (2.38) 

so that ICf,{M,R) is a subspace of R^. At the end of this subsection, we include 
some remarks about how to proceed in the case of truly complex .sq G C \ R. 

For Krylov subspace-based projection, we choose the matrix (2.30), y„, such 
that 
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Kh{M,R) C colspany„. (2.39) 

Since, by construction, ICf,{M,R) has dimension « and, by (2.30), V„ is assumed to 
have rank «, the condition (2.39) implies that 

h<n. (2.40) 

An obvious choice for V„ is a matrix whose columns form a basis of /C„(M, /?), 
such as the vectors generated by the band Arnoldi process. In this case, we have 
M = « in (2.40). 

Let A„, E„, and B„ be the matrices (2.31), and let H„ be the transfer function 
(2.28) of the corresponding reduced-order model (2.26). The main result of Krylov 
subspace-based projection then states that H„ is a Pade-type approximation of the 
original transfer function (2.24), H, in the following sense: 



H„{s) ^ H{s) + oUs - soY'^"''] , where j{h) > 



(2.41) 



Moreover, in the generic case, ^(m) — [w/mj in (2.41). The property (2.41) is well 
known. For example, it was established for various special cases in [5, 21, 24]. 
Proofs for the general case be found in [13, 16]. We remark that the matrix- valued 
coefficients of the Taylor expansion of the transfer function (2.24), H, about the 
expansion point sq are often called moments and that the Pade-type approximation 
(2.41) is also referred to as moment matching. 

If instead of (2.38), the expansion point ^o is chosen to be truly complex, then 
JCii{M,R) is a complex subspace and the projection matrix V„ in (2.39) will be 
complex as well in general. One possibility of keeping the projection matrix real is 
to replace the complex matrix V„ satisfying (2.39) by the real matrix 

[Rey„ Imy„]. (2.42) 

The obvious disadvantage of this approach is that the dimension of the reduced- 
order model is doubled. Furthermore, in general, the matrix (2.42) is not guar- 
anteed to have full column rank, and so before using (2.42) as a projection matrix, 
one would need to check for and possibly delete any linearly dependent columns of 
(2.42). On the other hand, the transfer function of the resulting reduced-order 
model will satisfy a Pade-type property of the form (2.41) for both sq and the 
complex conjugate expansion point sq\ see, e.g., [22]. 



2.3.5 Structure Preservation 

Recall from (2.19) and (2.17) that the matrices A, E, and B exhibit certain block 
structures reflecting the fact that the descriptor system (2.20) describes an RCL 



2 The SPRIM Algorithm for Stmcture-Preserving Order Reduction 39 

circuit. As long as the expansion point sq is chosen to be real, the reduced-order 
model generated via projection preserves the passivity of the original RCL circuit, 
but not these block structures of the data matrices. In fact, in general V„ will be a 
dense matrix, and then the data matrices (2.31) of the reduced-order model will be 
dense matrices as well. 

For example, consider the 'minimal' choice of the matrix y„ in (2.39), i.e., 
« = n and V„ is chosen as any matrix whose columns span lCh{M,R). The resulting 
order reduction method is mathematically equivalent to PRIMA [24-26]. How- 
ever, in general the data matrices of the PRIMA reduced-order model are dense 
and do not preserve the block structures of the original matrices A, E, and B. In the 
next section, we describe how these structures can indeed be preserved by taking 
advantage of the fact that in (2.39) we can choose a matrix V„ with n > h. 

The reader may ask why preservation of the block structures of the matrices A, 
E, and B is important. There are two reasons: 

1. In practice, one would like to synthesize the system described by the reduced- 
order model (2.26) as an actual physical electronic circuit. It is well known that 
passivity of the reduced-order model is sufficient to guarantee the existence of 
such a synthesized circuit; see, e.g., [1]. However, passivity alone is not enough 
to guarantee synthesis as an RCL circuit, and in general, some other circuit 
elements are needed. Additional properties, such as reciprocity, of the reduced- 
order model are necessary to ensure synthesis as an RCL circuit. While pre- 
serving the block structures of the data matrices of the original descriptor 
system alone is not enough to guarantee synthesis as an RCL circuit, it ensures 
reciprocity and significantly increases the probability that the reduced-order 
model can be synthesized as an actual RCL circuit. 

2. Preserving the block structure also doubles the number of Taylor coefficients 
that are matched in the Pade-type approximation property of the reduced-order 
model. More precisely, the structure-preserving SPRIM algorithm matches 
twice as many coefficients as the non-structure-preserving PRIMA algorithm; 
see Eq. 2.47 in Sect. 2.4.3 below. 



2.4 The SPRIM Algorithm 

The SPRIM (Structure-Preserving Reduced-order Interconnect Mac romode ling) 
algorithm was originally introduced in [15] for the special case of RCL circuits 
with only current sources, but no voltage sources. In this case, the third block row 
and column of A and E in (2.19) and the third block row of E in (2.17) are non- 
existent. This significantly simplifies both the implementation of SPRIM and the 
derivation of certain theoretical properties of SPRIM. In this section, we describe 
the SPRIM algorithm for the case of general RCL circuits with both voltage and 
current sources. 
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2.4.1 Preserving the Block Structures 

The main computational step of SPRIM is the generation of a suitable basis for the 
nth block Krylov subspace /C,^(M,/?). This step is identical to what is done in 
PRIMA. Let V;, be the resulting matrix, the columns of which form a basis of 
ICf,{M,R). PRIMA employs this matrix as the projection matrix to obtain the 
reduced-order data matrices (2.31). As pointed out before, in general, these 
PRIMA data matrices are dense and thus do not preserve the block structures of the 
original matrices A, E, and B. 

Instead of using the matrix Vf, directly for the projection, SPRIM employs a 
modified version of this matrix that trivially leads to structure preservation. To this 
end, y„ is first partitioned as follows: 



V,= 



y(i) 
y(2) 
y(3) 



(2.43) 



Here, the block sizes correspond to the block sizes of A and E in (2.19). While Va 
has full column rank h, the same is not necessarily true for the three subblocks 
V\ / = 1, 2, 3, in (2.43). In particular, the third block, V^^^ is of size n,. x h, 
where n,, denotes the number of voltage sources of the given RCL circuit. The 
number n^ is very small and usually «,, < h. Therefore, V* ' typically does not have 
full column rank. In the actual implementation of SPRIM, we run a Gram-Schmidt 
algorithm on the rows of y*^* to determine a matrix V'"'' the columns of which 
span (instead of spans) the same space as the columns of V^^\ but has full column 
rank. The other two blocks usually have many more rows than columns, and these 
blocks are unlikely not to have full column rank. In the actual implementation of 
SPRIM, there is the option to check the column ranks of the first two blocks and 
replace them by matrices V''' and V*^' of full columns rank. Next, we set up the 
actual projection matrix V„ as follows: 



Vn 



y('' 
y(2) 

W'> 



(2.44) 



By construction, we have 



JCf,{M,R) = colspanV,-, C colspan V„. 



(2.45) 



Thus the matrix (2.44), V,„ satisfies the condition (2.39), which in turn guarantees 
the Pade-type approximation property (2.41). Furthermore, in view of the block 
structure of V„, the data matrices (2.31) of the resulting reduced-order model 
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obtained via projection with V,, have the same block structure as the original data 
matrices A, E, and B. 



2.4.2 The Algorithm 



The order reduction procedure outlined in the previous subsection is the SPRIM 
algorithm for general RCL circuits. SPRIM can be formulated as an actual algo- 
rithm in the following form. 

Algorithm 2 (SPRIM for general RCL circuits) 

• Input: matrices of the form 



A = 


"All -Ai -A,,' 
A] 
Al 


, E- 


where An < 0,En t 0, and L ^ 0; 
an expansion point sq e K. 


• Formally 


set 





?ii 


0" 




"A 


" 





L 


, B = 



















-/ 



M={sqE-AY^E, R^{sqE-A)^B. 

• Until « is large enough, run your favorite block Krylov subspace method 
(applied to M and R) to construct the columns of the basis matrix 

of the Mth block Krylov subspace JCfi{M,R), i.e., 
colspan Vf, = JCh{M,R). 

• Let 

Vh = 



y(i) 
y(2) 
y(3) 



be the partitioning of V^ corresponding to the block sizes of A and E. 

• For 1 = I, 2, 3 do: 

If ri := rank V''^ <«, determine an A' x r/ matrix V''' with 

colspan V^'' = colspan V^'' and rank v"' = r/. 



• Set 



All 
En 



(y(i))^Aiiy(i), A,^{v^'^yA,WK A=(v<")U^v<'\ 
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Output: the data matrices 





A 


11 -Al -A 




'En 





An = 


A] 


, En = 





I 




Al 




. 







'Ai 








Bn = 




. -(y(3')^ 


; 






of the SPRIM red 


uced-order mode 


1 







and 



En—x{t) =A„i{t)+B„u{t), 
at 

~y{t)=Bl~x{t). 



(2.46) 



2.4.3 The Pade-Type Approximation Property of SPRIM 

Recall that SPRIM is a Krylov subspace-based projection method. In particular, 
the matrix V,, used in the projection satisfies the condition (2.39), which implies 
the Pade-type approximation property (2.41). It turns out that SPRIM actually has 
a stronger Pade-type approximation property. More precisely, the transfer function 
H„ of the SPRIM reduced-order model (2.46) satisfies 



H„{s) = H{s) + o(^{s - ^o)^-'''"') , where ;(«) > 



(2.47) 



instead of (2.41). The integer J(m) is the same as in (2.41), and in the generic case, 
j{h) = \h/m\ in (2.47). The property means that at the expansion point sq, the 
transfer function of the SPRIM reduced-order model matches twice as many 
leading Taylor coefficients as the theory of general Krylov subspace-based pro- 
jection methods predicts. 

This higher accuracy of SPRIM is a consequence of structure preservation. 
Projection methods that do not preserve the structures of the original data matrices, 
such as PRIMA, do not satisfy the more accurate Pade-type property (2.47). 

The proof of (2.47) is relatively straightforward for the special case of RCL 
circuits without voltage sources; see [15]. In this case, the third block row and 
column of the matrices A and E and the third block row of the matrix B are empty, 
and one can easily make both matrices A and E symmetric by changing the sign of 
the second block row, without changing B. This symmetry, along with the cor- 
responding symmetry of the data matrices of the SPRIM reduced-order models, 
can be used to establish (2.47). In the case of general RCL circuits, it is no longer 
possible to make A and E symmetric without changing B, and a different approach 



/ 





" 





-/ 











-/ 
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to proving (2.47) is required. The key observation here is that the matrices A and 
E are J-symmetric, i.e., they satisfy 

JA = A^J and JE = E'^J, 

with respect to the indefinite matrix 



/: 



where the /'s denote identity matrices of the same size as the diagonal blocks of 
A and E. Since the matrices A„ and E„ of the SPRIM reduced-order model have the 
same block structure as A and E, respectively, the matrices A„ and E„ are 
/„-symmetric with respect to a 'reduced' version J„ of J. Finally, the projection 
matrix (2.44), V„, employed in SPRIM is compatible with / and /„ in the sense that 

JV„ = V„J„. (2.48) 

In [16], we developed a general theory of Krylov subspace-based projection of 
/-symmetric descriptor systems. More precisely, we showed that the stronger 
Pade-type approximation property (2.47) holds true, provided that the data 
matrices of the reduced-order models are /^-symmetric and the compatibility 
condition (2.48) is satisfied. By applying this more general result [16, Theorem 9] 
to SPRIM, we obtain its stronger Pade-type approximation property (2.47). 



2.5 Treatment of Voltage Sources 

Recall from (2.19) that the third block row and column of the matrices A and 
E arise due to the presence of voltage sources in the given RCL circuit. Note that 
the size of the third block rows is n^ x N and the size of the third block columns is 
N X «,,. Here, «,, denotes the number of voltage sources, which is usually very 
small. In SPRIM, the corresponding third block y*^* of the matrix (2.43), V,-, is 
usually rank deficient and thus needs to be replaced by a block V'^' of full column 
rank, see Algorithm 2. 

In many cases, it is actually possible to treat all or at least some of the voltage 
sources separately and to apply the model reduction algorithm itself to a slightly 
smaller RCL circuit. The reason is that voltage sources are usually connected to 
the ground node of the circuit and they are connected to the remaining RCL circuit 
by a resistor. Such a case is illustrated in Fig. 2.1, which shows an RCL circuit 
with 4 such voltage sources connected to the ground node and by a resistor to the 
remaining RCL network. Recall from (2.16) that the given voltages of the voltage 
sources are part of the input vector u{t) of the descriptor system (2.20) and from 
(2.16) that the unknown currents through the voltage sources and the voltages at 
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Fig. 2.1 An RCL circuit for 
wtiich 4 voltage sources can 
be eliminated before model 
reduction is applied 




Fig. 2.2 A typical voltage 
source connected to the 
ground node and by a single 
resistor to the remaining RCL 
network 



Node! • VjCt) 



R 



Nodel W Vi(t) 



Kit) 




i(t) 



the circuit nodes are part of the unknown state-vector x(t) of (2.20). For voltage 
sources connected to the remaining RCL network as in Fig. 2.1, there are some 
trivial relations between some of these circuit quantities, which can be used to 
reduce the state-space dimension of (2.20). More precisely, consider a typical 
voltage source connected to ground and to a resistor, as shown in Fig. 2.2. In this 
case, the voltage Vi(?) at node 1 is equal to the voltage Vy{t) provided by the voltage 



Vl(f) =Vy{t). 



(2.49) 



By Kirchhoff's current law, the unknown current i^.(t) through the voltage source is 
equal to the unknown current /,-(?) through the resistor. Together with Ohm's law 
for resistors, it follows that 



h{f)^h{t)=-{v2{t)~Vi{t)), 



(2.50) 
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where V2(f) denotes the unknown vohage at node 2 and R is the given resistance of 
the resistor. Using the two relations (2.49) and (2.50), we can eliminate the 
unknowns vi(?) and iy(t) from the circuit equation, thus reducing the state-space 
dimension by 2. 

The above procedure can be carried out for all voltage sources that are con- 
nected to the ground node and by a single resistor to the remaining RCL network. 
If njf ' denotes the number of such voltage sources in the given RCL circuit, then 

the state-space dimension of the resulting descriptor system is A^'''^ -.^ N ~ Inlf . 
Here, A^ is the state-space dimension of the original descriptor system (2.20). 

It is relatively straightforward to carry out this elimination on the matrices of 
the original system (2.20). Here, we omit the full details and just state the final 
result. The given RCL circuit is again described by a descriptor system that now 
has the following form: 



dt ^ ' ^ ' ^ ' 

3;(f)=Z)M(r)+(cW)V)(f). 



(2.51) 



Here, A*'^' and Zs*'^ are A^'''^ x A^''' matrices corresponding to the remaining RCL 
network. These matrices have the same block structure as the matrices in (2.20), 
but with the sizes of each first and third block row and column reduced by iiy. In 
particular, in the case that all voltage sources have been eliminated, then A**^^ and 
£'^'^' have no third block rows and columns at all. The state-vector jt'^'^f) of (2.51) is 
obtained from the state-vector x{t) of (2.20) by deleting the voltages at the nodes 
between the eliminated voltage sources and the directly connected resistors and the 
currents through the eliminated voltage sources. The input and output vectors of 
(2.51) are the same as in (2.16). We now partition these vectors as follows: 



u{t) 



\-um 


vif'(0 


u\t)\ 



and y{t) 






(2.52) 



Here, the superscripts "(e)" and "('")" refer to eliminated and remaining volt- 
ages sources, respectively. Finally, in (2.51), D e """""" ' "''''' '"'''^ - ^ 
are matrices of the following form: 
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(2.53) 
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Here, R^ denotes the diagonal matrix the entries of which are the resistances of the 
resistors connected directly to the eliminated voltage sources, and the partitions in 
(2.53) are conforming with the partitions of the input and output vectors (2.52). 
To obtain a reduced-order model of the descriptor system (2.51), we can again 
employ SPRIM. The data matrices of the reduced system are obtained analogous 
to (2.31), together with the additional relations 

C„ := yJC<'' and D„ := D. 

It is easy to see that SPRIM applied to the descriptor system (2.51) preserves the 
structures of all the data matrices. 



2.6 Numerical Examples 

In this section, we present some results for two classes of circuit examples. In all 
cases, we ran the SPRIM algorithm for increasing values of the dimension h of the 
underlying block Krylov subspaces and stopped when the transfer function of the 
SPRIM reduced-order model had converged to the transfer function of the unre- 
duced original descriptor system. Here, convergence is monitored over a relevant 
frequency range of interest of the form 



where i = v — 1- For the value of h at convergence of SPRIM, we also generated 
the corresponding PRIMA reduced-order model produced from the same 
«-dimensional block Krylov subspace. This is a fair comparison since generating 
basis vectors for this subspace is the dominating computational cost for both 
SPRIM and PRIMA. In all cases, we plot the absolute values of H{s) and H„(s) (for 
both SPRIM and PRIMA) over the frequency range of interest. 

The first example is a variant of the so-called PEEC circuit [29]. It only has 
state-space dimension A^ — 308, but due to its many poles and zeros close to the 
frequency range of interest, its transfer function has many features. This variant of 
the PEEC circuit has two current sources and one voltage source, and thus m = 3. 
The expansion point sq — n x 10'° was used. In this case, the Krylov dimension 
M = 90 is needed to achieve convergence. Figure 2.3 depicts the absolute values of 
the (l,l)-component of the 3 x 3-matrix-valued transfer functions. Clearly, for 
h = 90 PRIMA has not converged yet. Figure 2.4 shows a close-up of the sub- 
range where the PRIMA and SPRIM reduced-order models differ the most. Fig- 
ures 2.5 and 2.6 display the corresponding plots for the (l,3)-component of the 
3 X 3-matrix-valued transfer functions. 

The second example (referred to as "package example") is a larger RCL circuit 
with state-space dimension A^ = 1841. This circuit has 8 current sources and 8 
voltage sources, and thus m — 16. Its transfer function is 16 x 16-matrix- valued 
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Fig. 2.3 PEEC example, 
(l,l)-component of transfer 
functions 
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Fig. 2.4 PEEC example, 
close-up of (l,l)-component 
of transfer functions 
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and has 256 components. The expansion point io = 27i x 10 was used. For this 
example, the Krylov dimension n = 128 is needed to achieve convergence. Fig- 
ures 2.7 and 2.8 depict the absolute values of the (8,l)-component and the (9,9)- 
component of the transfer functions. Note that for « = 128 PRIMA has not con- 
verged yet. 

The 8 voltage sources of the package example are all of the type shown in 
Fig. 2.2, and so all voltage sources can be eliminated using the approach outlined 
in Sect. 2.5. We have applied SPRIM and PRIMA to the resulting descriptor 
system (2.51) of state-space dimension A'*'' = 1841 — 16 — 1825. As before, the 
Krylov dimension h — 128 is needed to achieve convergence. Figure 2.9 shows 
the absolute values of the (16,9)-component of the transfer functions. 
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Fig. 2.5 PEEC example, 
(l,3)-component of transfer 
functions 
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Fig. 2.6 PEEC example, 
close-up of the (1,3)- 
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functions 
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2.7 Concluding Remarks 



In this paper, we reviewed the formulation of general RCL circuits as descriptor 
systems and described the SPRIM reduction algorithm for general RCL circuits. 
While there has been a lot of progress in Krylov subspace-based order reduction 
of large-scale RCL circuits in recent years, there are still many open problems. All 
state-of-the-art structure-preserving methods, such as SPRIM, first generate a basis 
matrix of the underlying Krylov subspace and then employ explicit projection 
using some suitable partitioning of the basis matrix to obtain a structure-preserving 
reduced-order model. In particular, there are two major problems with the use of 
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Fig. 2.8 Package example, 
(9,9)-component of transfer 
functions 
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such explicit projections. First, it requires the storage of the basis matrix, which 
becomes prohibitive in the case of truly large-scale linear dynamical systems. 
Second, the approximation properties of the resulting structure-preserving 
reduced-order models are not optimal, and they show that the available degrees of 
freedom are not fully used in general. It would be highly desirable to have 
structure-preserving reduction methods that do not involve explicit projection and 
would thus be applicable in the truly large-scale case. Other unresolved issues 
include the automatic and adaptive choice of suitable expansion points sq and 
robust and reliable stopping criteria and error bounds. 
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Fig. 2.9 Package example 
with voltage sources 
eliminated, (16,9)-component 
of transfer functions 
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Chapter 3 

Balancing-Related Model Reduction 

of Circuit Equations Using Topological 

Structure 



Tatjana Stykel 



Abstract In recent years, model order reduction has been recognized to be a 
powerful tool in analysis and simulation of integrated circuits. We consider 
balancing-related model reduction methods for differential-algebraic equations 
arising in circuit simulation. We show how positive real and bounded real bal- 
anced truncation can be used for passivity-preserving model reduction of circuit 
equations. These methods are based on balancing the solutions of projected Lur'e 
or Riccati matrix equations and admit computable error bounds. We also discuss 
efficient algorithms for solving such matrix equations that exploit the topological 
structure of circuit equations. Numerical experiments demonstrate the perfor- 
mance of the presented model reduction methods. 

Keywords Model reduction • Balanced truncation • Circiut equations • Passivity • 
Matrix equations 



3.1 Introduction 

Modern integrated circuits have hundreds of millions of semiconductor devices 
whose feature size is nowadays reaching the nanometer range. These devices are 
placed on several layers and interconnected to each other by wires. Due to 
increased packing density and interconnect length, modelling of thermal and 
electromagnetic effects is highly required in order to verify that the heat con- 
duction and internal electromagnetic field do not disturb signal propagation. 
Design of VLSI circuits with distributed elements is no longer possible without 
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computer simulations that involve numerical solution of coupled systems of partial 
differential equations and differential-algebraic equations (DAEs). After spatial 
discretization, such systems have very large state space dimension that makes the 
analysis and simulations unacceptably time consuming and expensive. In this 
context, model order reduction is of great importance. 

A general idea of model order reduction is to approximate the large-scale system 
by a much smaller model that captures the input-output behavior of the original 
system to a required accuracy and also preserves essential physical properties such 
as stability and passivity. Many different model reduction approaches have been 
developed in computational fluid dynamics, control design and electrical and 
mechanical engineering, see [2, 15, 70] for recent books on this topic. One of the 
most used model reduction techniques in circuit simulation is moment matching 
approximation based on Krylov subspace methods, e.g., [4, 30, 38]. Although these 
methods are efficient for very large sparse problems, the resulting reduced-order 
systems have only locally good approximation properties. Another drawback of 
the moment matching methods is that stability and passivity are not necessarily 
preserved in the reduced-order models, so that usually post-processing is needed 
to realize these properties. Recently, passivity-preserving model reduction 
methods based on Krylov subspaces have been developed for structured systems 
arising in circuit simulation [31, 33, 45, 56] and also for general systems [3, 28, 
41, 72]. However, none of these methods provides computable global error 
bounds. 

Balanced truncation is another model reduction approach commonly used in 
control design. In order to capture specific system properties, different balancing 
techniques have been developed in the last thirty years for standard state space 
systems [26, 39, 53, 55, 60, 77] and also for DAEs [7, 14, 59, 63, 74]. In particular, 
passivity-preserving balanced truncation has been considered in [9, 13, 60, 63-65, 83]. 
An important property of balancing-related model reduction methods is the exis- 
tence of computable error bounds. Unfortunately, these methods have a reputation 
for being very expensive since they involve solving (projected) Lyapunov and/or 
Riccati matrix equations. However, recent developments on iterative methods for 
such equations [16, 49, 57, 71, 75] show that balanced truncation methods can also 
be applied to large-scale problems. 

In this paper, we give a brief survey on model reduction of circuit equations 
using balanced truncation and its relatives. In Sect. 3.2, we present some basic 
foundations from graph theory and network analysis required in the following. 
In Sect. 3.3, the balanced truncation model reduction approach for DAEs is 
described. Passivity-preserving model reduction methods for circuit equations 
based on positive real and bounded real balanced truncation are also considered. In 
Sect. 3.4, we discuss numerical solution of projected Lyapunov and Riccati 
equations with large-scale matrix coefficients. Section 3.5 contains some results of 
numerical experiments demonstrating the efficiency of the balancing-related 
model reduction techniques. 

Throughout the paper, R""' and C"'™ denote the spaces of n x m real and 
complex matrices, respectively. The open left and right half -planes are denoted by 
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C_ and C+, respectively, and / = V^. The matrices A^ and A denote, respec- 
tively, the transpose and the conjugate transpose of A e C"'™, and A^^ = (A"') . 
An identity matrix of order n is denoted by /„ or simply by /. We use rank(A) and 
ker{A) for the rank and the kernel of A, respectively. A matrix A e C"" is positive 
definite {semidefinite), if v Av > (v*Av > 0) for all non-zero v e C". Note that 
positive (semi)definiteness of A does not require A to be Hermitian. For 
A,B e C"'", we write A > B (A > 5) if A - 5 is positive definite (semidefinite). 



3.2 Circuit Equations 

In this section, we briefly describe the formulation of linear RLC circuits via DAEs 
and discuss their properties. For more details on graph theory and network anal- 
ysis, we refer to [1, 22, 44, 79]. 

A general electrical circuit can be modelled as a directed graph © — (91, 95) 
whose vertices n^ e 91 correspond to the nodes of the circuit and whose edges 
(branches) bj, jtj = (n^, ,nt,) e 23 correspond to the circuit elements like capacitors, 
inductors and resistors. For the ordered pair b^j jt, = (nj^j,n^,), we say that hi^^j^^ 
leaves n^., and enters n^,. An alternating sequence (n^ , b^j , n^, , . . ., n^_ , , b^;.^ j , n^^J 
of vertices and edges in © is called a. path connecting n^^, and n^^ if b^^ = (n^^, n,i,^,) 
and tt^^ 7^ n^ for 2 < ( <j <s. A path is closed if nj., and n^^ are the same, and open if 
they are different. A closed path is called a loop. A graph © is called connected if for 
every two vertices there exists an open path connecting them. A cutset is a set of 
edges of a connected graph whose removal disconnects the graph, and this set is 
minimal with this property. A subgraph of the graph © is called a tree if it has all 
nodes of ©, is connected and does not contain loops. 

Any directed graph © = (91,©) with Si = {rti, . . .,n„,^+i} and S = {bi,...,b„J 
can be described by an incidence matrix Aq = [au] € M"''+^'"'' defined as 

{1 if edge b/ leaves vertex rt/t, 
— 1 if edge b/ enters vertex n^, 
otherwise. 

In a connected graph, any n,, rows of Aq are linearly independent. Thus, deleting 
any row from Aq yields a full rank matrix A € R"''"* known as reduced incidence 
matrix. For circuits, the deleted row corresponds to a reference (or grounding) 
node. 

We now consider a general linear RLC circuit that contains linear resistors, 
inductors, capacitors, independent voltage sources and independent current sour- 
ces only. Such circuits are often used to model the interconnects, transmission 
lines and pin packages in VLSI networks. They arise also in the linearization of 
nonlinear circuit equations around DC operating points. RLC circuits are com- 
pletely described by the graph-theoretic relations like Kirchhoff's current and 
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voltage laws together with the branch constitutive relations that characterize the 
circuit elements. Kirchhojf's current law states that the sum of the currents along 
all edges leaving and entering any circuit node is zero. Kirchhojf's voltage law 
states that the sum of the voltages along the branches of any loop is zero. Let 



J 



UrJc^Jl'JvJi 



T]T 



and V 



IvLv^ 



'R:VC'Vl,Vy,V'j 



T]T 



denote the vectors 



of branch currents and branch voltages, respectively, and let the reduced incidence 
matrix A = [Ar,Ac,Ai,Av,Ai] be partitioned accordingly, where the subscripts 
R, C, L, V and / stand for resistors, capacitors, inductors, voltage sources and 
current sources, respectively. Then Kirchhoff's current and voltage laws can be 
expressed in the compact form as 



A; - 0, 



PJ}y- 



V, 



(3.1) 



respectively, where rf e M"'' denotes the vector of potentials of all nodes excepting 
the reference node. 

The branch constitutive relations for the linear capacitors, inductors and 
resistors are given by 



C-vc{t)=jc{tl 



VL{t) 



Cj^Jdtl Mt) 



njR{t), 



(3.2) 



where C G M"^'"^, £ e W-"'- and TZ e W"-"" are the capacitance, inductance and 
resistance matrices, respectively. These matrices are often diagonal and their 
diagonal entries are the capacitances, inductances and resistances of the capacitors, 
resistors and inductors, respectively. However, the diagonal structure gets lost in 
case of mutually coupled elements. IfTZ and C are nonsingular, then Q = 7?."' and 
S = L^ are the conductance and susceptance matrices, respectively. 

Using relations (3.1) and (3.2), the behaviour of a linear RLC circuit can be 
described via modified nodal analysis (MNA) [79] by the following system of 
DAEs 



where 
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The number of state variables « = «,,+«£ + ny is called the order of system (3.3), 
and m — rij + ny is the number of inputs and outputs. In the following we will 
assume that the circuit is well-posed in the sense that it has neither V-loops nor 
I-cutsets. These assumptions can be written in terms of the incidence matrices as 
follows: 

(Al) The matrix Ay has full column rank, i.e., rank(Av) = ny. 

(A2) The matrix AcLRv = [Ac,A£, Ar, Ay] has fullrowrank, i.e., rank(Ac£Ry) = «,,. 

We will also assume that 

(A3) C, Q and C are positive definite. 

Assumptions (A1)-(A3) together guarantee that the matrix pencil XE — A is 
regular, i.e., det(/l£ — A) ^ Q for some X e C, see [32]. In this case, we can 
define a transfer matrix G{s) — C{sE — A)~^B + D that describes the input- 
output relation of (3.3) in the frequency domain. The transfer function G is called 
proper if lim^^oo G{s) <oo, and improper, otherwise. If limj^oc G{s) — 0, then 
G is called strictly proper. 

Any regular pencil XE — A can be reduced into the Weierstrass canonical form 



Ti 



I„f 
£^, 



Tn A^T, 



Af 

/„. 



r„ (3.5) 



where 7) and T,. are the left and right nonsingular transformation matrices, and £33 
is nilpotent with index of nilpotency p, see [34]. The eigenvalues of Af are the 
finite eigenvalues of XE — A, and Err, corresponds to an eigenvalue at infinity. The 
number /^i is called the index of XE — A and also of the DAE system (3.3). Index 
concept plays an important role in the analysis and numerical solution of DAEs, 
e.g., [19, 20, 37, 46, 67]. The following proposition characterizes the index of the 
MNA equations (3.3), (3.4). 

Proposition 1 [27] Let E and A be as in (3.4) and let (A1)-(A3) be fulfilled. 

1 . The index of the pencil XE — A is at most two. 

2. The index of XE — A is equal to zero if and only if 

ny ~ 0, rank(Ac) = «,,. (3.6) 

3. The index of XE — A is equal to one if and only if 

rank(gcAv) = Mv, rank[Ac,AR,Av] = «,, (3.7) 

where Qc is a projector onto ker{Ac). 

Considering the topological structure of the circuit, the conditions (3.6) imply 
that the circuit does not contain voltage sources and the circuit graph contains a 
capacitive tree. Furthermore, the first condition in (3.7) implies that the circuit 
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does not contain CV-loops except for C-loops, whereas the second condition in 
(3.7) means that the circuit does not contain Ll-cutsets. 

Using the Weierstrass canonical form (3.5), the MNA system (3.3), (3.4) can be 
decoupled into the slow subsystem 

ii(r) ^AfXi{t)+Bfu{t), (3.8a) 

yi{t)^CfXi{t), (3.8b) 

and the fast subsystem 

EooXiit) = X2{t)+B^u{t), (3.9a) 

y2{t) = CocX2(r), (3.9b) 

where Trx{t) = [{xi{t)f, {x2{t))Y ,y{t) = yi{t)+y2{t) and 



rr'B = 



Bf 
B^ 



, cr-'==[C/,cj. (3.10) 

Equation 3.8a with the initial condition xi(0) = a'[ has a unique solution 

t 
XI (f) = e'^fx° + f e^'-'^^' Bfu{x)dr 





for any integrable input u and any initial vector x" e M"^ Since the index fi of 
system (3.3), (3.4) does not exceed two, a unique solution of Eq. (3.9a) is given by 

X2{t) = -BaoU{t) - E^BooU{t). 

This representation shows that for the existence of a continuously differentiable 
solution X of (3.3), (3.4), it is necessary that the input function u is fi times 
continuously differentiable. Moreover, the initial condition x(0) = xq has to be 
consistent, i.e., xq = T^^[{x^^) , (.Xj) ] must satisfy 

xl^-B^u{0)-E^B^u{0). 

If the initial vector xq is inconsistent or the input u is not sufficiently smooth, then 
the solution of the MNA system (3.3), (3.4) may have impulsive modes [20]. 



3.2.1 Stability 

Stability is a qualitative property of dynamical systems which describes the 
behaviour of their solutions under small perturbations in the initial data. For the 
linear time-invariant DAE system (3.3), stability can be characterized in terms of 
the finite eigenvalues of the pencil IZs — A, e.g., [24]. System (3.3) is stable if all 
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the finite eigenvalues of AZs — A lie in the closed left half-plane and the eigen- 
values on the imaginary axis are semi-simple, i.e., they have the same algebraic 
and geometric multiplicity. System (3.3) is asymptotically stable if the pencil 
aE — Ais c-stable, i.e., all its finite eigenvalues lie in the open left half-plane. The 
following proposition gives the topological conditions for the asymptotic stability 
of the MNA equations (3.3), (3.4). 

Proposition 2 [66] Let the matrices E and A be as in (3.4) and let (A1)-(A3) be 
fulfilled. Assume that C and C are symmetric and one of the following two pairs of 
topological conditions holds: 

1. rank[AL, Ay] == ni + ny, rank[Afi,Av] = «,, (3-11) 

2. rank[Ac,A£, Ay] == «c + «l + "v, i"ank[A/,, Ar,Ai/] = «,,. (3.12) 
Then the MNA system (3.3), (3.4) is asymptotically stable. 

Conditions (3.11) are equivalent to the absence of LV-loops and CLI-cutsets 
(except maybe for Ll-cutsets), whereas (3.12) implies that the circuit does not 
contain CLV-loops (except maybe for CV-loops) and Cl-cutsets. 

If system (3.3) is asymptotically stable, then the Hoc-norm of its transfer 
function G is defined as ||G||jj = sup^^gjj ||G(((d)||, where || • || denotes the 
spectral matrix norm. 



3.2.2 Passivity and Positive Realness 

Passivity is a most basic property of circuit equations. Generally speaking, pas- 
sivity means that the system does not produce energy. More precisely, system (3.3) 
is passive if 



I 



u{xfy{x)dT>0 (3.13) 



T 



for all f > and all admissible u such that m y is locally integrable. For a circuit 
element with a voltage v and a current 7, condition (3.13) implies that the storage 
energy of this element defined as 



v{zfi{T:)dz 



is always nonnegative. Thus, capacitors, resistors and inductors with nonnegative 
element values are passive. Furthermore, interconnection of a finite number of 
passive circuit components yields a passive network [1]. 
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It is well known in network theory [1] that the DAE system (3.3) is passive if 
and only if its transfer function G(s) — C(sE — A)~'fi + D is positive real, i.e., 
G is analytic in C+ and G{s) + G(s)* > for all s € C+. Using the Weierstrass 
canonical form (3.5) and (3.10), the transfer function of (3.3) can be additively 
decomposed as 

G(^) = Gsp{s) + Mo + 5M, + ■ ■ ■ + ^"-'M^,-i , 

where Gsp{s) = Cf{sl -~ Af)^ Bj is the strictly proper part of G, Mq = D ~ CocBoc 
and Mil = —CooE^Boo for k>l. One can show that G is positive real if and only if 
its proper part Gp{s) — Gsp{s) + Mq is positive real, M[ = Mi > and M<. — for 
A: > 1, see [1]. 

The following proposition gives sufficient conditions for system (3.3), (3.4) to 
be stable and passive. 

Proposition 3 If Assumptions (A1)-(A3) are fulfilled and the matrices C and C 
are symmetric, then the MNA system (3.3), (3.4) is stable and passive. 

Proof The facts that the pencil ?.E — A in (3.4) has no finite eigenvalues in C+ 
and the transfer function G(s) — C(sE — A)~^B of (3.3), (3.4) is positive real have 
been proved in [61, 66], respectively. Analogously, we can show that {sE — A)~^ 
is also positive real. Hence, the purely imaginary eigenvalues of }.E — A are semi- 
simple [1, Theorem 2.7.2]. Thus, the MNA system (3.3), (3.4) is stable and 
passive. D 



3.2.3 Contractivity and Bounded Realness 

An important class of dynamical systems are contractive systems. System (3.3) is 
called contractive if 

\\\u{T)\\'~\W)f)dT>0 (3.14) 



for all f > and all admissible u such that u and y are both square integrable. The 
integral in (3.14) expresses the difference between the input and output energy of 
the system. One can show that (3.3) is contractive if and only if its transfer 
function G is bounded real, i.e., G is analytic in C+ and / — G(i)*G(i) > for all 
s E C+, see [1]. For the asymptotically stable system (3.3), contractivity is 
equivalent to the condition \\G\\-^ < 1 that justifies the name 'contractive'. Note 
that the bounded real transfer function is necessarily proper. 

Positive real and bounded real square transfer functions are related to each other 
via a Moebius transformation defined as 

Ji{G){s) = (/ - G(^))(/ + G(^))"'. 
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The transfer function G is positive real if and only if the Moebius-transformed 
function G{s) = ^/{G){s) is bounded real [1]. For system (3.3) with nonsingular 
I + D, the function G{s) can be represented as G{s) = C{sE — A)" B + D, where 



A-B{I + D) 'C, 



B 



c = V2{i + Dy'c, b = {i~D){i 

For the MNA matrices as in (3.4), we have 

^ArGAI-AiA] 



V2B{I + Dy 



AcCAl 
CO 




Al 



-Al 





~Av 

-I 



B^V2 



'A, 


0" 











/ 



(3.15) 



C 



D 



It has been shown in [64] that under Assumptions (A1)-(A3) the pencil IE ~ Am 
(3.15) is of index at most two. It is equal to one if and only if 
rank[Ac,A^,i4/,Av] = «,,. This condition means that the circuit does not contain 
L-cutsets. 



3.2.4 Reciprocity 

Another relevant property of circuit equations is reciprocity. We call a matrix 
S e W"'"' a signature if S is diagonal and S — /,„. System (3.3) is reciprocal with 
an external signature S^a € K'"'"' if its transfer function satisfies G{s) = 
5extG(.j) 5ext for all i e C. The following proposition shows that the symmetry of 
C, C and Q guarantees the reciprocity of system (3.3), (3.4). 



Proposition 4 [62] Let Assumptions (A1)-(A3) be fulfilled and let the matrices C, 
C and Q be symmetric. Then the MNA system (3.3), (3.4) is reciprocal with the 
external signature Se^t — diag(/„,, —Iny)- 



3.3 Balancing-Related JVIodel Reduction 



The aim of model order reduction for circuit equations is to approximate the DAE 
system (3.3), (3.4) with a reduced-order model 



Ex{t) 

m 



■ Ax{t) 

■ Cx{t) 



Bu{t), 
' bu{t), 



(3.16) 
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where £, A e R^'^ B e R^'"', C G R'" ', D G M'"'"' and £ < n. It is required for the 
approximate system (3.16) to preserve physical properties like stability, passivity 
and reciprocity. Such a system can then be synthesized as an electrical circuit in an 
standard netlist format, e.g., [62, 84] and Chap. 12 of this volume, that is often 
required in the industrial circuit simulators. It is also important to have a small 
approximation error y — y or G — G, where G{s) = C{sE — A) B + D. In the 
ideal case, we would like to have a computable error bound that allows us to 
approximate (3.3) to a given accuracy and makes model reduction fully automatic. 
Most of the model reduction methods for linear dynamical systems are based on 
the projection of the system onto lower dimensional subspaces. In this case, the 
system matrices of the reduced-order model (3.16) have the form 

E^W'^ET, A = W'^AT, B=W^B, C ^ CT, (3.17) 

where the projection matrices W,T £ R"'^ determine the subspaces of interest. For 
example, in modal model reduction the columns of W and T span, respectively, the 
left and right deflating subspaces of the pencil IE — A corresponding to the 
dominant eigenvalues, e.g., [25, 50]. In the moment matching approximation, one 
chooses the projection matrices W and T whose columns form the bases of certain 
Krylov subspaces associated with (3.3), e.g., [4, 30]. 



3.3.1 Balanced Truncation Model Reduction 

Balanced truncation also belongs to the projection-based model reduction tech- 
niques. This method consists in transforming the dynamical system into a balanced 
form whose controllability and observability Gramians are both equal to a diagonal 
matrix. Then a reduced-order model (3.16), (3.17) is obtained by projecting (3.3) 
onto the subspaces corresponding to the dominant diagonal elements of the bal- 
anced Gramians. This idea goes back to [54] and has been extended over the years 
in different directions by many authors, e.g., [11, 14, 26, 35, 39, 52, 53, 55, 58, 60, 
74, 77]. 

For standard state space systems with E — I, the balanced truncation model 
reduction method makes use of the dual Lyapunov equations 

AG, + GcA^ = --BB^, A^G„ + GoA = ~C^C. 

If all eigenvalues of the matrix A have negative real part, then these equations have 
unique symmetric, positive semidefinite solutions G, and Gg known as the con- 
trollability and observability Gramians, respectively. One can show that all 
eigenvalues of the product GcGo are real and nonnegative. The square roots of 
these eigenvalues, denoted by cr,, are called the Hankel singular values of system 
(3.3) with E — I. Such a system is balanced if G, — Go = diag(cri, . . ., Oy^). If the 
system is controllable and observable, then the Gramians G, and Gg are both 
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positive definite. In this case, there exists a balancing state space transformation 
such that the Gramians of the transformed system become equal and diagonal with 
the Hankel singular values on the diagonal. Then the reduced-order model is 
obtained by truncating the states corresponding to the small Hankel singular 
values. Such states are simultaneously difficult to reach and to observe, since they 
have a small impact on the energy transfer from input to output, see [35, 53] for 
details. 

The balanced truncation model reduction approach can be extended to system 
(3.3) with E ^ I. If E is nonsingular, then the Gramians are defined as unique 
symmetric, positive semidefinite solutions of the generalized Lyapunov equations 



AG^E'^ + EGcA^ 



BB' 



A^G„E + E^G„A= -C^C, 



(3.18) 



provided the pencil IE — A is c-stable. However, for singular E, these equations 
cannot be used any more to determine the Gramians for the DAE system (3.3). As 
the following example shows, the generalized Lyapunov equations (3.18) with 
singular E may not have solutions even if aE — A is c-stable. Moreover, if the 
solutions of (3.18) exist, they are always nonunique, see [73] for detailed 
discussions. 

Example 1 Consider the simple RL circuit shown in Fig. 3.1. This circuit is 
described by the DAE system (3.3) with 



E = 









0' 










c 






, A = 








0. 











1 


1 





-l/7^ 


1 





1 


-1 








-1 












B^=[0, 0, 0, -1] = C. 

The pencil }.E — A has only one finite eigenvalue 1 = —TZ/C < 0. However, the 
generalized Lyapunov equations (3.18) are not solvable. 

An extension of the balanced truncation method to DAEs based on projected 
Lyapunov equations has been presented in [52, 74]. Unlike the standard state space 
case, the DAE system (3.3) has two pairs of the Gramians, one pair for the slow 
subsystem (3.8a, b) and the other pair for the fast subsystem (3.9a, b). If (3.3) is 



Fig. 3.1 A simple RL circuit 



Vy 



O 



L 
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asymptotically stable, then the proper controllability and observability Gramians 
Gpc and Gp^ of (3.3) are defined as unique symmetric, positive semidefinite 
solutions of the projected generalized continuous-time Lyapunov equations 

EGpA^ + AGp,E'' = ~P,BB^Pj, Gp, = PrG,„Pj., (3. 19) 

E^GpoA + A^GpoE = -P^C^CPr, Gpo = PjGpoPi, (3.20) 

where Pi and P^ are the spectral projectors onto the left and right deflating sub- 
spaces of the pencil IE — A corresponding to the finite eigenvalues along the left 
and right deflating subspaces corresponding to the eigenvalue at infinity. Using the 
Weierstrass canonical form (3.5), these projectors can be represented as 



Pr 



I 




Tr, Pi = Ti 



I 




Tf'. 



Furthermore, the improper controllability and observability Gramians Gic and Gjo 
of system (3.3) are defined as unique symmetric, positive semidefinite solutions of 
the projected generalized discrete-time Lyapunov equations 

AG,,A^ - EGiaE^ = QiBB^Qj, G,, = QrG,cQj., (3.21) 

A^Gu,A-E^G,oE = QjC^CQ„ G;„ = Q]G;oQu (3.22) 

where Qi = I — Pi and Qr = I ~ P, are the complementary projectors. Note that 
unlike generalized Lyapunov equations considered in [42, 48, 76], the existence 
and uniqueness results for the projected Lyapunov equations (3.19)-(3.22) can be 
stated independently of the index of the pencil IE — A, see [73]. 
Using the proper and improper Gramians, we can define the proper and improper 
Hankel singular values that characterize the importance of state variables in the 
slow and fast subsystems (3.8), and (3.9), respectively. Let «/be the dimension of 
the deflating subspaces of IE — A corresponding to the finite eigenvalues. Then 
the proper Hankel singular values <Jj of system (3.3) are defined as the square roots 
of the largest n f eigenvalues of the matrix GpcE^GpgE, and the improper Hankel 
singular values Oj are defined as the square roots of the largest n^o = n — nf 
eigenvalues of the matrix GicA^GioA. We assume that the proper and improper 
Hankel singular values are ordered decreasingly. System (3.3) is balanced if the 
Gramians satisfy 

Gpc = Gpo = diag(5:, 0) with Z = diag((7i, . . ., (T„J, 
G,-, = G,v, - diag(0, 0) with = diag(Oi,...,O„^). 

States of the balanced system corresponding to the small proper Hankel singular 
values are less involved in the energy transfer from inputs to outputs, and, 
therefore, they can be truncated without changing the system properties 
significantly. Furthermore, we can remove the states of the balanced system 
corresponding to the zero improper Hankel singular values. Such states are 
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uncontrollable and unobservable at infinity and do not influence the input-output 
relation. However, if we truncate the states that correspond to the non-zero 
improper Hankel singular values, even if they are small, then the approximation 
may be inaccurate. These states are subject to constraints, and their elimination 
may lead to undesirable disturbances in the approximate system and physically 
meaningless results. 



Example 2 Consider the DAE system (3.3) with 



"0 


1 


0' 








1 












/3, B 



10 

0.1 





C^ 



0.04 

30 

1 



(3.23) 



Since E is nilpotent, this system has only the improper Hankel singular values 
given by 9^ — 3.4, O2 = 4.7 x lO"*", 63 — 0. The truncation of the state corre- 
sponding to the Hankel singular value 63 = results in the reduced-order model 



1.18 
-1.18 



1.18 
-1.18 



0^^ 


■ 


x{t 


) + 


■ 1.84 X 10 





10^ 






2.25 X 10 



i(f) 



y{t) = [1.84 X 10^-2.25 x lO-^]x{t). 



u{t), 



(3.24) 



Figure 3.2a shows the output functions of the original and the reduced-order 
systems with the input u{t) — sin(f). They coincide since both systems have the 
same transfer function. However, if we truncate one more state corresponding to 
the second Hankel singular value, which is relatively small, we obtain the standard 
state space system 



Jc{t) = 850x(f) + 1567M(f), y{t) = lMx{t). 



(3.25) 



This system is unstable, and, as Fig. 3.2b demonstrates, its output has nothing in 
common with the output of the original system. 

We summarize the balanced truncation model reduction method for DAEs in 
Algorithm 1 . For this method, we have the following a priori error bound. 

\\y - jIIl, < l|G - G||n^ ||m||l, < 2{ai^+i + ■■■ + o,f)\\u\\j^^_ 

that allows an adaptive choice of the order of the approximate model. Furthermore, 
the resulting reduced-order system (3.16) is asymptotically stable and its index 
does not exceed the index of (3.3). If C, C and Q in (3.4) are symmetric, i.e., (3.3), 
(3.4) is reciprocal with the external signature Sext as in Proposition 4, then the 
reduced-order model computed by Algorithm 1 is also reciprocal with the same 
signature. Unfortunately, the Lyapunov-based balanced truncation method does 
not, in general, ensure the preservation of passivity. However, for special reci- 
procal circuits such as RC and RL networks. Algorithm 1 can be modified for 
computing a passive reduced-order model, see [65] for details. 
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8 10 




0.03 



Fig. 3.2 a The output functions of the original system (3.3), (3.23) and the reduced-order system 
(3.24); b the output of the reduced-order system (3.25). In both cases, the input is u{t) = sin(f) 



Algorithm 1 Balanced truncation model reduction for DAEs. 
Given G — (E, A, B, C, D), compute a reduced-order model G — {E,A, B, C, D). 



1 . Compute the Cholesky factors Rp and Lp of the proper Gramians Gpc — RpRl and 
Gpo — LplF that satisfy the projected Lyapunov equations (3.19) and (3.20), 



respectively. 

2. Compute the Cholesky factors 7?, and L, of the improper Gramians Gic ~ R/Rj 
and Gio — L/Lj that satisfy the projected Lyapunov equations (3.21) and (3.22), 
respectively. 

3. Compute the singular value decomposition Lj^ERj, = [U\, (/2ldiag(Zi,S2)[Vi, Vi]"'^, 
where the matrices [Ui,U2] and [yi,V'2] have orthonormal columns, Si = 
diag((7i, . . ., (T(^.) and E2 = diag((T^,+i, . . ., ff,.). 

4. Compute the singular value decomposition LJAR, = Ui&V^, where U3 and V3 
have orthonormal columns and = diag(Oi, . . ., Oi^) is nonsingular. 

5. Compute the reduced-order system (£, A, S, C, D) = {W'^ET, W'^AT, W'^B, CT, D) 
with W = [Lp[/lE7'''^L,■[/3^"'''^] and T = [RpVil.'^'^ ,RiVi@-^l\ 



3.3.2 Positive Real Balanced Truncation 



In this section, we describe passivity-preserving model reduction for general RCL 
circuits based on positive real balancing. 

Passivity of the DAE system (3.3) can be characterized via the projected 
positive real Lur'e equations 
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and 



AXE^ + EXA^ = -KcKJ, 


X = p,.xpI > 0, 




(3.26: 


EXC^ - P,B = -KJI, 


M^ + Ml^Jj] 




A^YE + E^YA = -KIKo, 


Y = pJypi > 0, 




(3.2?: 


E^YB - P'^.C'^ = -KIj,„ 


Mo + Ml ^ JlJo 




xeR"'", K,em"-"\ j. 


eM'"'"' and F e M"'", 


Ko 


e M'"'" 



with unknowns 

Jo G K™'", respectively. Such equations are known in the literature also as 
Kalman-Yakubovich-Popov equations [40]. Similarly to [64, Theorem 4.1], one 
can show that if the MNA system (3.3), (3.4) is passive, then (3.26) and (3.27) are 
solvable. Conversely, solvability of the projected Lur'e equations (3.26) and (3.27) 
together with the conditions Mi = M\ > and M*, = for ^ > 1 implies that (3.3) 
is passive. 

Remark 1 Note that for a general DAE system, passivity alone does not guarantee 
the existence of the solution of the projected Lur'e equations. For such a system, in 
addition, R-minimality conditions 

rank[/l£ - A, B] = n, rank[;.£^ - A^, C^] = n for all ;. e C 

(or other weaker conditions) have to be assumed [23, 29, 47, 62]. 

The projected Lur'e equations (3.26) and (3.27) have, in general, many sym- 
metric solutions X and Y that can be ordered with respect to the Loewner ordering 
in the set of symmetric matrices. The minimal solutions Xpr and Ypr that satisfy 
<Xpr <X and < y,,,. < Y for all symmetric solutions X and Y of (3.26) and 
(3.27), respectively, are called the positive real controllability and observability 
Gramians of (3.3). System (3.3) is called positive real balanced if Xp,- — Ypr = 
diag(H,0) with S = diag((^i, . . ., <^„ ). The values £,j ordered decreasingly are 
called the. positive real characteristic values of (3.3). Similarly to Lyapunov-based 
balanced truncation, the reduced-order system (3.16) can be computed by pro- 
jecting onto the subspaces corresponding to the dominant positive real charac- 
teristic values and non-zero improper Hankel singular values. Note that if system 
(3.3) has a proper transfer function, then solving the projected discrete-time 
Lyapunov equations (3.21) and (3.22) can be avoided. The positive real balanced 
truncation method for such a system is summarized in Algorithm 2. It can be 
shown that the resulting reduced-order system is passive, and we have the error 
bound 

||G-G||H,^<2||(Mo + Mo^)-'||||Go||Hj|Go||eJVi + --- + ^'r) (3-28) 

with Go = G + MJ and Go = G + Mj, see [9, 63]. Note that this error bound 
requires the computation of the Hoc -norm of Go, which is expensive for large-scale 
systems. If (.f is chosen in Algorithm 2 such that 
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4||(Mo+M^)-'||||Go||hJ^^+i 
then bound (3.28) can be simplified to 



+ '^^)<1, 



\\G~G\\^^<4\\iMo + M^)-'\\\\Go\\ij^,,+i + --- + Q, (3.29) 

where only the evaluation of the Hoc -norm of the reduced-order system Go is 
required. 

Algorithm 2 Positive real balanced truncation for DAEs with a proper transfer 

function. 

Given passive G = (E, A, B, C, Z)), compute a reduced-order system G = {E,A,B, C,b). 

1. Compute the matrix Mq = C{sqE - AY^QiB + D sq e (0,oo). 

2. Compute the Cholesky factors R and L of the positive real Gramians Xj,,^ — RR^ 
and Ypr — LlJ that are the minimal solutions of the projected positive real 
Lur'e equations (3.26) and (3.27). 

3. Compute the singular value decomposition L^ER = [Ui, f/2ldiag(Hi,H2)[Vi, V2Y , 
where the matrices [U\,U2\ and [ViiVa] have orthonormal columns, Hi = 
diag(<^i, . . ., ^ij) and S2 = diag((^^^+i, . . ., ^„^.). 

4. Compute the reduced-order system 



E = 



I 




A = 



WlATy 
/, 



B = 



WjB 
Bj 



c = [cri,C2], b = D, 



where W\ ~ LU\E^ 
D - Mo = C2B2. 



-1/2 



— 1/2 

Ti — RVii^i , and B2 and C2 are chosen such that 



If Do = Mo + Mq is nonsingular, then the projected Lur'e equations (3.26) and 
(3.27) can be written as the projected positive real Riccati equations 

AXE^ + EXA^ + [EXC'^ - PiB)D^ ' {EXC'^ - PiB) ^ = 0, X = PrXP], (3.30) 
A^YE + E'^YA + {B'^YE - CPrYD^\B'^YE - CPr) =0, F = pJyPi- (3.31) 



The numerical solution of these equations will be discussed in Sect. 3.4.2. The 
major difficulty in solving these equations is that the spectral projectors P, and 
Pi are required. They can be computed by the matrix chain approach from [51]. 
In the large-scale setting, however, it would be beneficial to have an explicit 
representation for P^ and P/ as it has been done in [75] for some other 
structured DAEs arising in computational fluid dynamics and multibody sys- 
tems. Such a representation in terms of the incidence matrices is currently 
under investigation. 
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3.3.3 Passivity-Preserving Model Reduction Via Bounded Real 
Balanced Truncation 

Another passivity-preserving model reduction approach presented first in [63] is 
based on the bounded real balanced truncation model reduction method applied to 
the Moebius-transformed system G = A^(G) = {E,A,B,C,b) as in (3.15). For 
the MNA equations (3.3), (3.4), where Q is positive definite and both C and C are 
symmetric and positive definite, it has been shown in [64] that the projected 
bounded real Lur'e equations 



and 



AXE^ + EXJi + PiWp] = -K,kJ, X = P,XpI > 0, 

Exc^ - PiBmI = -/^,.yj, / - MqMI = 7,/^ 



A^YE + fYA + Plc^CPr ^ -KIKo, Y = P^YP, > 0, 
-E^YB + Plc^Mo = ~KlJ„ , I - Mo Mo - J^Jo 



(3.32) 



(3.33) 



are solvable for X e R"'",Kc G M"'™, Jc € M™'™ and Y e M"'", Ko e M'"'", 
Jo G M™'™, respectively. Here, P,. and Pi are the spectral projectors onto the right 
and left deflating subspaces of ?.E — A corresponding to the finite eigenvalues 
along the right and left deflating subspaces corresponding to the eigenvalue at 
infinity, and Mq — limj^oc C{sE — A)" B + D. The minimal solutions X/,,- and F^^ 
satisfying < Xhr < X and 0<Y[,r<Y for all symmetric solutions X and Y of 
(3.32) and (3.33), respectively, are called the bounded real controllability and 
observability Gramians of system G. This system is bounded real balanced 
if Xhi- — Y),,. — d\2Lg{T ,Q) with F = diag(7i, . . .,7^). The values jj ordered 

decreasingly are called the bounded real characteristic values of G. Truncating the 
states of G corresponding to small y, and applying the Moebius transformation to 
the obtained contractive reduced-order model, we get a passive reduced-order 
system G. The resulting passivity-preserving model reduction method for circuit 
equations is presented in Algorithm 3. For this method, we have the following a 
priori error bound 

11^ ^11 . ll'' + G||H^(y/,+i + --- + 7„,) 



^-P + G||h^(7^,+i + ••• + )',,.)' 

provided ||/ + G||jj (iV+i + ■ ■ ■ + 7^) < 1, see [63]. Furthermore, if we choose If 
in Algorithm 3 such that 2||/ + G||jj (7^^+! + ••• + )'»)< 1, then we obtain the a 
posteriori error bound 
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|G-G||H^<2||/ + G||^Jy^+i + ... + 7^) 
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(3.34) 



that is inexpensive to compute. 

Algorithm 3 Passivity-preserving model reduction based on bounded real bal- 
anced truncation. 

Given passive G — (E, A, B, C, 0), compute a reduced-order model 
G= (£,A,B,C,0). 

1. Compute the Moebius-transformed system G = {E,A,B,C,D) as in (3.15). 

2. Compute the matrix Mq = D + C{sqE — A)" QjB for some sq e (0,oo). 

3. Compute the Cholesky factors R and L of the bounded real Gramians 

Xhr = RR and Yj,,- = LL that are the minimal solutions of the projected 
bounded real Lur'e equations (3.32) and (3.33). 

4. Compute the singular value decomposition L ER = [C/i , t/2]diag(ri , Fa) [Vi , V2] , 
where the matrices [1/1,1/2] and [VijVi] have orthonormal columns, Fi = 
diag(yi, . . ., 7^) and F2 = diag()v^.+i, ...,y^). 

5. Compute the reduced-order system 



E = 



B = 



I 0' 
.0 


1 


~ 1 


2w\Afi 
~V2B2Cti 


; V2w]bC2 
; 2/ - B2C2 _ 


-B2I 


B 

V2. 


, c' 


== 


_cl/V2_ 


1 



-1/2 



where Wi =Lt/iFi''% Ti =/?yiF, 
/ - Mo = C2B2. 



-1/2 



and B2 and C2 are chosen such that 



It should be noted that for DAE systems with a proper transfer function. 
Algorithms 2 and 3 are equivalent in the sense that they provide reduced-order 
models with the same transfer function. 

Using the topological structure of circuit equations, the matrix Mq and the 
projector Pr can be computed in explicit form 



Mo = 



I~2AjQcHa'Q'cAi 

1/ 



2AjQcH^'QlAv 



2AlQcH^ ' QlA, -I + lAlQcH^, ' Q^Ay 



Hi{H^H2-I) H5H4ALH6 
H(, 

-AI{HaH2-I) -AIH^lH6 



(3.35) 



(3.36) 
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where 

^0 = QUArGAI+AjAJ +AvAl)Qc + Qljy_cQRiv-c, 

H\ — PcRiv^cRiv + Qcriv^lSA^Qcriv , 

Hi = ArGAI + A,A] + AvAl + AlSAIQcrivH^'QIkiv^lSAI, 

H, = AcCAl + QlHiQc, H^ = QcH^'QI, 

H5 = QcRivH^ Qf^j^jyAiSA^ —I, H(, = I ~ SA^QcRivH^ Qcriv'^l, 

QcRiv is a projector onto ker( [Ac, Ar, A/, Ay]^), Pcriv = 1 ~ Qcriv, 

Qriv-c is a projector onto ker([AR, A/, Av]^Qc), 

see [64] for details. Furthermore, the left projector is given by Pi ~ SintP,. ,Sint, where 
^int = diag(/„, , — /„i, —lyiv) and P,-,, is as P,- with Q, S and C replaced by Q^ , S^ and 
C^ , respectively. The projectors Qc, Qcriv and Qriv-c can easily be computed in 
sparse form using graph search algorithms like breadth- first-search [44]. Although 
Mo and Pr look very complex, their computation is inexpensive if the sparsity of the 
incidence matrices and Q-projectors is exploited. Due to the space limitation, we 
omit detailed discussions. 

If the circuit contains neither CVI-loops except for C-loops nor LVI-cutsets 
except for L-cutsets, i.e., if rank(Q^[A/,Av]) = «/ + «v and Q^f-.[Ai,Ay\=0, 

where Qrc is a projector onto ker([AR,Ac] ), then both D^ — I — MqMq and 

^ J- * 
Do = I — MqMq are nonsingular [64], and the projected Lur'e equations (3.32) 

and (3.33) can be written as the projected bounded real Riccati equations 
AXE^ + EXA^ + IPiBB^p] + 2{EXC^ - PiBmI)D^\eXC'^ - PiBulf = 0, 



T 



X = PrXP^ 



and 



(3.37) 



aJyE + E^YA + IpIc^CP,- + 2{B'^YE - MlcPrfb^\B'^YE - mIcP,) = 0, 
Y = p]yPi. 

(3.38) 

The numerical solution of these equations will be considered in Sect. 3.4.2. 



3.3.4 Balanced Truncation for Reciprocal Circuit Equations 

For reciprocal circuits with symmetric C, C and Q, we can further exploit the 
structure of the system matrices in (3.4) in order to reduce the computational 
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complexity of Algorithms 2 and 3. In the sequel, we consider Algorithm 3 only. 
The other one can be modified analogously. 

Consider the reciprocal system (3.3), (3.4), where C, £ and G are symmetric. 
Then 

^ = 'Jint^'Jinti ^ = 'Jint^'Jinti " = 'Jextl"->int 

with ^int = diag (/„,,, -/„^, -/„ J and ^ext = diag(/„,, -/„,,). Therefore, we have 



"/ — "Jint "r "Ji 



int^ r ^int: 



Yhr — >Jint^i;-»->int — »->int"" -Jint — ^^ ■ 



(3.39) 



Since L ER = R SintER and (/ — Mo)5ext are both symmetric, the characteristic 
values )', and the matrices B2 and C2 can be determined from the eigenvalue 

decompositions of 7? Si^ER and (/ — ^^0)56x1 instead of a more expensive singular 
value decomposition. We summarize the resulting passivity-preserving balanced 
truncation method for reciprocal electrical circuits (PABTEC) in Algorithm 4. 
Note that this method also preserves reciprocity in the reduced-order model, see 
[64]. 

Algorithm 4 Passivity-preserving balanced truncation for electrical circuits 
(PABTEC). 

Given passive G = {E, A, B, C, 0) with the system matrices as in (3.4), com- 
pute a reduced-order model G — {E,A,B, C, 0). 



1. Compute the Cholesky factor R of the bounded real Gramian Xh,- = RR that is 
the minimal solution of the projected Lur'e equation (3.32), where E, A, B and 
C are as in (3.15), the projectors P,. and P/ are given in (3.36) and (3.39), 
respectively, and Mq is as in (3.35). 

2. Compute the eigenvalue decomposition R 5i„,M = [C/i, [/2]diag(Ai, A2)[C/i, C/2]^, 
where [f/i, U2] is orthogonal, A 1 — diag(/li, . . ., i^) and A2 =diag(/l^+i, . . ., 1^). 

3. Compute the eigenvalue decomposition (/ — Mo)5'ext = UqAqUq, where Uq has 
orthonormal columns and Aq = diag(li, . . .,/!,,„) is nonsingular. 

4. Compute the reduced-order system 



B = 



I 0' 
.0 0. 


■) 


^-l 


IW^ATi 
_~V2B2Cfi 


V2w]bC2 
21 ~ B2C2 . 


.-52/ 


B 

V2_ 


, c' 


= 


'{cf.f 


: 



where 
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B2^So\Ao\'^^UlS,,„ C2 = t/o|Ao 



II 

1/2 



with 



|Ai| ==diag(|li|,...,|i/||), Si ==diag(sign(li),...,sign(A^)), 
|Ao| = diag(|li|,...,|l„|), ^0 = diag(sign(ii), . . ., sign(lm)). 



3.4 Numerical Methods for Matrix Equations 

In this section, we consider numerical algorithms for the projected Lyapunov 
equations (3.19) and (3.20) and the projected Riccati equations (3.30), (3.31) and 
(3.37), (3.38) developed in [10, 75]. In practice, the numerical rank of the solutions 
of these equations is often much smaller than the dimension of the problem. Then 
such solutions can be well approximated by low-rank matrices. Moreover, these 
low-rank approximations can be determined directly in factored form. Replacing 
the Cholesky factors of the Gramians in Algorithms 1-4 by their low-rank factors 
reduces significantly the computational complexity and storage requirements in the 
balancing-related model reduction methods and makes these methods very suitable 
for large-scale DAE systems. 



3.4.1 ADI Method for Projected Lyapunov Equations 

First, we focus on solving the projected Lyapunov equation 

EXA^ + AXE^ = -PiBB^P], X = P,.XP,^, (3.40) 

using the alternating direction implicit (ADI) method. The dual equation can be 
treated analogously. The ADI method has been first proposed for standard 
Lyapunov equations [16, 49, 57, 80] and then extended in [75] to projected 
Lyapunov equations. The generalized ADI iteration for the projected Lyapunov 
equation (3.40) is given by 



{E + x,A)Xk-ii2A^ + AXu-i{E - x,aY = -P,BB^Pj, 
(E + T^A)X[A^ + AX[_i/2(£ - r^Af = -P,BB^Pj 



(3.41) 



with an initial matrix Xq = and shift parameters ti, . . ., tj e C_. If the pencil 
aE — A is c-stable, then Z<. converges towards the solution of the projected 
Lyapunov equation (3.40). The rate of convergence depends strongly on the choice 
of the shift parameters. The optimal shift parameters providing the superlinear 
convergence satisfy the generalized ADI minimax problem 
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(1 -Tit) ■■■{I - V)l 



{ti,...,t^}= argmin 



max 



{T,,...,T,}eC_ 'eSpjiEA) 1(1 + Tif) • • • (1 + Zgt)\ 

where Spf{E, A) denotes the finite spectrum of the pencil aE — A. Similarly to 
[57], the suboptimal ADI parameters can be obtained from a set of largest and 
smallest in modulus approximate finite eigenvalues of XE — A computed by an 
Arnoldi procedure. Other parameter selection techniques developed for standard 
Lyapunov equations [17, 69, 81] can also be used for the projected Lyapunov 
equation (3.40). 

A low-rank approximation to the solution of the projected Lyapunov equation 
(3.40) can be computed in factored form X « Z^Zj using a low-rank version of the 
ADI method (LR-ADl) as presented in Algorithm 5. 

Algorithm 5 The generalized LR-ADI for the projected Lyapunov equation. 
Given E,A e M."'", B e K"-'", projector P/ and shift parameters Ti, . . ., t, e C_, 
compute a low-rank approximation X w Z^Zl to the solution of the projected 
Lyapunov equation (3.40). 

1. ZO = ^-2Re(Ti)(£ + TiA)"'P,B, Zi =Z(i'; 

2. FORyt = 2,3,... 



ZW - W^g^(/- (T,_i +t,)(£ + t,A)-'a)z(*-", Z, = [Z,_,,Z«] 

END FOR 

In order to guarantee for the factors Z^ to be real in case of complex shift 
parameters, we take these parameters in complex conjugate pairs {t^^, ti-^-i = t^}. 
At each iteration we have Zj = [Z"', . . .,Z(*'] e M"""^. To keep the low-rank 
structure in Z^ for large mk, we can compress the columns of Z^. using the rank- 
revealing QR factorization [21] as described in [8]. 

Finally, note that the matrices (E + t^. A)~^ in Algorithm 5 do not have to be 
computed explicitly. Instead, we solve linear systems of the form {E + TkA)x = Pib 
either by computing (sparse) LU factorizations and forward/backward substitutions 
or by using iterative Krylov subspace methods [68]. 



3.4.2 Newton-Kleinman Method for Projected Riccati Equations 

In this section, we consider the numerical solution of the projected Riccati 
equation 
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Mix) = EXF'^ + FXE^ + EXH^HXE^ + niQQ^YlJ = 0, X = U^xn^, 

(3.42) 



where 



F =A-P,B{Mo + Ml) ^CP,... H = j;^CPr, Q = Bjf, 
Mo + M^ = J,jJ, n 

in the positive real case and 



Mo + M^ = J,jJ, n, = Pi, n, = Pr 



(3.43) 



F = A-BC- 2PiBmI{I - MomI) ' CP,-, 

H = V2J- ' CP, , 2 = - ^25/; ' , (3 .44) 

jIJo = i-mImq, j,jJ^i-MomI, n, = p,, n, = p. 



in the bounded real case. The minimal solution X^^^ of (3. 42) is at least semi- 
stabilizing in the sense that all the finite eigenvalues of IE — F — EX^^Ji^H are 
in the closed left half-plane. Such a solution can be computed via the Newton- 
Kleinman method [10] as presented in Algorithm 6. 

Algorithm 6 The generalized Newton-Kleinman method for the projected 
Riccati equation. 

Given E, P e M"'" , // G M"''" , Q e M"'"', projectors n„ H/ and a stabilizing 
initial guess Zq, compute an approximate solution of the projected Riccati equation 
(3.42). 
FOR; =1,2,..., 

1. Compute Kj = EXj^iH^ and Fj = F + KjH. 

2. Solve the projected Lyapunov equation 

EXjFj + FjXjE^ = -n,(ee^ - KjKf)nJ, Xj = n,XjnJ.. 

END FOR 

Similarly to the standard state space case [6, 78], one can show that if aE — F is 
c-stable, then for Xq — 0, all ?.E — Fj are also c-stable and lim^^oc ^j — ^min- The 
convergence is quadratic if the pencil IE — F — EX^^Ji^H is c-stable. Some 
difficulties may occur if the pencil aE — F has eigenvalues on the imaginary axis. 
For circuit equations, these eigenvalues are uncontrollable and unobservable [64]. 
In that case, similarly to [12], one could choose a special stabilizing initial guess 
Xq that ensures the convergence of the Newton-Kleinman iteration. However, the 
computation of such a guess for large-scale problems remains an open problem. 

A low-rank approximation to the minimal solution of the projected Riccati 

equation (3.42) can be computed in factored form X^in w RR with R e R"'*, k <^ 
n using the same approach as in [11]. Starting with Ki = EX^H^ and F\ = 
F + K\H, in each Newton iteration we solve two projected Lyapunov equations 
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EXijFj + FjXijE^ = -UiQQ^nJ, Xy,j = P.XijP'^, (3.45) 

EX2jFj + FjX2.jE^ = -n,KjKjnJ, X2J = n,X2jnl, (3.46) 

for the low -rank approximations Xij w RijRf : and X2J « R2,jR{ :, respectively, 
and then compute /Ty+i = E^RijRjj - RijRlJjH'^ and Fj+i ^ F + Kj+iH. If the 
convergence is observed after yVnax iterations, then an approximate solution 

^min ~ RR of the projected Riccati equation (3.42) can be computed in factored 
form by solving the projected Lyapunov equation 

EXF^ + FXE^ = -niQoQllli, X = n,xn^ (3.47) 

with Qq = \Q, £(Zi,j^^ — X2j_^^ J//^] provided IE — F is c-stable. For computing 
low-rank factors of the solutions of the projected Lyapunov equations (3.45)-(3.47), 
we can use the generalized LR-ADI method. Note that in this method we need to 
compute the products {E + t Fjf w with t e C_ and w G M". For example, in the 
bounded real case we have E + xFj = £ + t(A — BC) — iKjH with the low-rank 
matrices H e R"'" and Kj = y/lPiBM^J;'^ - Kj e M"'"'. Then one can use the 
Sherman-Morrison-Woodbury formula [36, Section 2.1.3] to compute these prod- 
ucts as 

{E + xFjY'w = wi + M^^ ((/„, - HMj,)-'Hw, ) , 

where Wj = (£ + t(A - BC))"V and M^, = t{E + z{A - BC)y'^kj can be 
determined by solving linear systems with the sparse matrix E + t(A — BC) either 
by computing sparse LU factorization or by using Krylov subspace methods [68] . 



3.5 Numerical Examples 

In this section, we present some results of numerical experiments to demonstrate 
the efficiency of the passivity-preserving balancing-related model reduction 
methods for circuit equations described in Sect. 3.3. 

Example 3 The first example is a three-port RC circuit provided by NEC 
Laboratories Europe. The passive reciprocal system of order n — 2007 was 
approximated by two models of order ^ = 42 computed by the positive real 
balanced truncation (PRBT) method and the bounded real balanced truncation 
(BRBT-M) method applied to the Moebius-transformed system. The minimal 
solutions of the projected Riccati equations (3.30) and (3.37) were approxi- 
mated by the low rank matrices Xp,. « Rpi-Rp,- with Rp,. e R"'^"' and X;„. « Rbi-Rbr 
with Rhi & R"'^^, respectively. Figure 3.3a shows the normalized residuals 
p{Xj) = \\-'MiXj)\\p/ WYliQQ'^nJWp, where || • ||^ denotes the Frobenius matrix 
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Fig. 3.3 RC circuit: (a) the convergence history of the low-rank Newton-Kleinman-ADI 
method; (b) the number of ADI iterations required for solving the projected Lyapunov equations 
at each Newton iteration 



norm, X, = Ri.jR{j ~ Ri.jRi.j, n, and Q are given in (3.43) and (3.44) for (3.30) 
and (3.37), respectively. Figure 3.3b displays the number of ADI iterations 
required for solving the projected Lyapunov equations at each Newton iteration. 

The spectral norms of the frequency responses ||G(i(u)|| and ||G(ia;)|| for a 
frequency range co e [1, 10^^] are presented in Fig. 3.4a. In Fig. 3.4b, we display 
the absolute errors ||G(!ffl) — G(jco)|| for both reduced-order systems and also the 
error bounds (3.29) and (3.34). As expected, due to the properness of G, the PRBT 
and BRBT-M methods are equivalent and provide similar results. 

Example 4 The second example is a transmission line model [5] that consists of 

20,000 RLC ladders. We approximate the DAE system of order n — 60,000 by a 

model of order £ = 32 computed by the PABTEC method (Algorithm 4). The 

— J 
bounded real Gramian X^^ was approximated by a low-rank matrix Xbr ~ RR with 

R e M"'^"*^. Figure 3.5a presents the bounded real characteristic values of the 
Moebius-transformed system computed as the absolute values of the eigenvalues 

of R SimER. One can see that the characteristic values decay rapidly, so we can 
expect a good approximation by a reduced-order model. The frequency responses 
of the full-order and the reduced-order models are not presented, since they were 
impossible to distinguish. Figure 3.5b shows the absolute error ||G(iffl) — G(/co)|| 
for CO e [1, 10^°] and the error bound (3.34). 



3.6 Conclusions and Open Problems 



In this paper, we have discussed balancing-related model reduction methods for 
linear DAEs arising in circuit simulation. The important property of these methods 
is the existence of computable error bounds that essentially distinguishes the 
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Fig. 3.4 RC circuit: (a) the frequency responses of the original and the reduced-order systems; 
(b) the absolute errors and error bounds 
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Fig. 3.5 Transmission line: (a) the bounded real characteristic values; (b) the absolute error and 
the error bound (3.34) 



balanced truncation technique from other model reduction approaches. Moreover, 
positive real balanced truncation and bounded real balanced truncation applied to a 
Moebius-transformed system guarantee the preservation of passivity in a reduced- 
order model that makes these methods very suitable for circuit equations. 

Balancing-related model reduction methods for DAEs involve solving projected 
Lyapunov, Lur'e and Riccati matrix equations. We have presented the efficient 
numerical algorithms for large-scale projected Lyapunov and Riccati equations 
based on the LR-ADl iteration and the Newton-Kleinman method, respectively. 
We have also shown that exploiting the topological structure of circuit equations 
reduces substantially the numerical complexity of balanced truncation. 

Although considerable progress has recently been achieved in developing of 
balanced truncation methods for large-scale DAEs, some problems still remain 

■J. ^ ^ T 

open. For example, if Mq + Mq (or / — MqMq ) is singular, then to compute the 
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positive real (bounded real) Gramians we have to solve the projected Lur'e 
equations. Similarly to the standard state space case [82], for small to medium- 
sized DAE systems, these equations can be transformed to projected Riccati 
equations of smaller dimension. This approach becomes, however, prohibitive for 
large-scale problems due to the explicit use of state space transformations. Another 
problem, already mentioned in Sect. 3.4.2, is the computation of an appropriate 
stabilizing initial matrix in the Newton-Kleinman iteration in case when the pencil 
has pure imaginary eigenvalues. This problem could probably be solved for circuit 
equations by exploiting their special structure. 

Finally, in some numerical experiments we observed a very slow convergence 
of the LR-ADl iteration caused by a poor choice of the shift parameters. The 
combination of the LR-ADl iteration with the Galerkin projection as proposed in 
[18, 43] for standard state space systems may improve the performance of the ADI 
method. Also, a generalization of an extended Krylov subspace method [71] to the 
projected Lyapunov equations remains for future work. 
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Chapter 4 

Topics in Model Order Reduction 

with Applications to Circuit Simulation 

Sanda Lefteriu and Athanasios C. Antoulas 



Abstract When integrating several components on a chip, one may be faced with 
the task of combining parts which were not manufactured in house. Oftentimes, no 
accurate model is available for these constituents. In this scenario, one may perform 
measurements of each device in the frequency range of interest. This information 
can then be employed to construct a reduced order model which can be used in a 
simulator which integrates the behaviour of each component and analyzes the entire 
chip. This paper addresses the problem of modeling a system based on measure- 
ments of its frequency response using the concept of tangential interpolation, 
together with the tool of the Loewner matrix pencil. This concept is particularly 
useful in the context of modeling components with a large number of input and 
output ports. After a thorough theoretical analysis which motivates the approach 
and makes connections to system theory, we present two numerical results which 
show its applicability to different systems arising in circuit simulation. 

Keywords Tangential interpolation • Lower marroit pencil • Multiport systems • 
System with delay 

4.1 Introduction and Motivation 

The current trend in VLSI circuits is decreasing the feature size while increasing 
the chip size, resulting in an ever-growing chip complexity. At the same time, the 



S. Lefteriu (E3) • Athanasios C.Antoulas 

Department of Electrical and Computer Engineering, Rice University, 

Houston, TX, USA 

e-mail: sanda.lefteriu@rice.edu 

Athanasios C.Antoulas 
e-mail: aca@rice.edu 



P. Benner et al. (eds.). Model Reduction for Circuit Simulation, 85 

Lecture Notes in Electrical Engineering, 74, DOL 10.1007/978-94-007-0089-5_4, 
© Springer Science-l-Business Media B.V. 2011 



86 Sanda Lefteriu and A. C. Antoulas 

operating frequency is also increasing. Hence, an important step in the design 
process is to be able to simulate the desired chip layout in a reasonable amount of 
time. Model order reduction proves to be a very useful tool in this case, since it is 
capable of decreasing the simulation time of complex systems to a few seconds by 
constructing models of smaller dimensions which preserve the essential properties 
and characteristics of the original one. 

Traditionally, model reduction has been applied to systems for which an accurate, 
but very complex model, is available, for example by discretizing Maxwell's 
equations. An alternative technique [2] is model reduction from time domain or 
frequency domain measurements of devices which have already been manufactured, 
for which no model is available. Constructing models from time or frequency 
domain data are essentially different research problems as each presents different 
challenges, but in this paper we focus on modeling from measurements of the 
frequency response. We do not investigate what is a good choice of sampling points, 
as this is a complex research question by itself. Assuming that the measurements 
capture the dynamic behaviour of interest, the next step is to construct a macromodel 
of low complexity which is consistent with the data. Reduced order modeling from 
frequency-domain measurements is related to the rational interpolation problem [2]. 

Several algorithms [6, 15, 24] are already available in the electronics com- 
munity. The goal of most techniques is fitting the data, essentially interpreting it is 
an optimization problem. Thus, most procedures are based on least-squares 
approximations. Vector fitting (VF) [7, 13-17] is a robust technique which solves 
the problem in two stages: one for computing the poles of the transfer function and 
the other for finding the residues. While the first stage is an iterative process in 
which a set of better poles is determined by solving a least-squares problem at each 
step, finding the residues only needs one least-squares approximation. This algo- 
rithm has been extended to handle time domain data [ 1 2] and several variants were 
developed for parametric macromodeling [8, 26] which incorporate the depen- 
dence on parameters such as geometry or material properties. 

The algorithm introduced in [6] uses Nevanlinna-Pick interpolation in the 
context of bounded-real interpolation of S-parameters. As shown in [3], passivity 
is enforced by interpolating at the original points (i',, H(i,)), together with the 

associated mirror-image points ( — .j^, jjp-ry)- Hence this method can not recover 

the original system, unless the latter satisfies the mirror-image interpolation con- 
straints, which is generally not the case. Moreover, this technique uses matrix 
interpolation, by using the whole matrix transfer function for each sample. 
Algorithms which enforce passivity by construction are described in [11, 22]. 

Our approach has its foundations in system theory and employs the concept of 
tangential interpolation in the Loewner matrix pencil framework. The Loewner 
and shifted Loewner matrices can be expressed in terms of the generalized con- 
trollability and observability matrices, tools which were first used in the control 
community, but have found applications in many other areas, including circuit 
simulation. Tangential interpolation is of great importance in our context due to 
the fact that it allows to build reduced models for devices with a large number of 
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input and output ports using a small computational time [20]. The issue of massive 
ports is problematic for currently available algorithms which are employed in 
macromodeling, as they are extremely CPU-demanding. Moreover, we show that 
if the number of available measurements is right, one can fully recover the 
underlying system. If more measurements are available than needed, plotting the 
singular values of the Loewner matrix pencil allows to identify the original system 
via an SVD truncation. Thus, in our approach, the data is revealing the order of the 
system, unlike vector fitting, which depends on a few parameters (the number of 
starting poles, the location of the starting poles, or the number of iterations to 
relocate the poles) to produce a good macromodel. 

The numerical examples we investigate are systems arising in circuit simula- 
tion. First, we analyze a single input single output system, which, due to the delay, 
is infinite-dimensional. Our goal is to built a finite-dimensional model which is 
able to capture the low-frequency response. Second, we compare tangential 
interpolation to matrix interpolation on an RC circuit with a large number of ports. 

This paper is organized as follows. Section 4.2 reviews some system-theoretic 
concepts related to model reduction from measurements. Next, we introduce the 
rational interpolation problem in the case of scalar data and matrix tangential data 
and propose a solution in the Loewner framework in Sect. 4.3. Moreover, this 
section provides guidelines on how to choose the sampling directions to ensure full 
recovery of the system. Section 4.4 briefly shows how to construct tangential data 
from given matrix data. Section 4.5 presents numerical examples which validate 
the proposed approach on systems arising in circuit simulation and we conclude in 
Sect. 4.6. 



4.2 Background 

An LTI system Z with m inputs, p outputs and n internal variables in descriptor- 
form representation is given by a system of differential and algebraic equations: 

Z : Ex(f) = Ax(f) + Bu(f), y(f) = Cx(f) + Du(r), (4.1) 

where x(f) is an internal variable (the state, if E is invertible), u(?) is an input, y(f) 
is the corresponding output, while E, A e M"""', B e M'""", C G W", D e W^'" 
are constant matrices with E possibly singular. The transfer function of Y. is 
H(i') = C(.?E — A)" B + D. It is proper if limj^oo H(s) <cx) and improper, 
otherwise. Moreover, if lim^^oc H(i') = 0, H(5) is strictly proper. The matrix 
pencil (A, E) is regular if the matrix A — IE is nonsingular for some finite X G C. 
The set [E, A, B, C, D] is called a realization of H(i). The realization is not unique; 
the one of the smallest possible order n is called a minimal realization and the 
order n is referred to as the McMillan degree. A realization is minimal if it is 
completely controllable and observable. A controllable system implies that any 
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realization is controllable, and the same holds for observability. A descriptor 
system with (A, E) regular is completely controllable [27] if rank[A — IE, B] — n, 

for all finite leC and rank[E,B] = n <^ the matrix .^ := (IjE - A)"' 
has rank n. It is called completely observable if 



B...(i„E. 
rank [A^ - 



-A)-'B] 
1E^,C^] = n, for all finite 1 e C and rank[E^,C^] = n, where (^ 



denotes transpose <^ the matrix (9 



C(^iE-A)- 



has rank n. The poles are 



.c(a<„e-a)- 

given by the eigenvalues of the matrix pencil (A,E). S is stable if all its finite 
poles are in the left-half plane: S stable ^ Re{l{A,E)) <0 for |1(A,E)| <cx). 

Model reduction from frequency-domain data can be formulated as a rational 
interpolation problem irrespective of the quantities which are measured. Assuming 
that our circuit is voltage-excited and the corresponding outputs are the port 
currents, we can measure the admittance parameters Y. A system E models a data 
set with P measurements of the Y-parameters of a device with p outputs and 
m inputs 



/ 



yi-,YW := 



V 



/(■■) 



y(i) 
>1 



Im 



y{i) 
l^ pm 



\ 



/=i,...,p, 



where fi is the frequency at which the parameters Y''' are measured, if the asso- 
ciated transfer function evaluated at j ■ Infi is close to Y^'' : 



Ii(J-2nfi)^Y^'>, /=!,..., P. 



(4.2) 



4.3 Theoretical Aspects 



We start by analyzing the rational interpolation problem in the simple case of 
model reduction from scalar data (i.e., m — p — 1): 

{si,Zi), i= 1,...,P, Si :^Sj, i^j, 

where .s,-,z, G C. The sampling points Sj are not necessarily on the imaginary axis, 
as it was the case in (4.2). The rational interpolation problem is equivalent to 

finding H{s) — '-jA, with n, d coprime polynomials and deg(n{s)) < deg(d(s)), such 

that H(i,) — Zi, i — 1,. . .,P. There always exists a solution in polynomial form, 
e.g., the Lagrange interpolating polynomial. Our approach is based on the Loewner 
matrix [4] which is constructed by partitioning the data in disjoint sets: 
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{li,Wi), i=l,...,k and {tij,Vj), j=l,...,h, 
where h,k fa [Q such that k + h — P, using the formula: 



89 



L = 



Vl—Wl 
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-Wl-l 
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-h 
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-4 J 



•^hy.k 



(4.3) 



There are several reasons why this is the right tool to use. The rank of the Loewner 
matrices constructed using all possible partitions encodes the degree of the min- 
imal interpolant of the data. This is due to the system theoretic interpretation [5] of 
the Loewner matrix as minus the product of the generalized observability matrix, 
G, the underlying E matrix and the generalized controllability matrix, 3i. In par- 
ticular, suppose that the data is obtained by sampling a proper rational function 
H(i') with minimal representation [E, A,B, C,0] (i.e., H(i) = C(.sE 
Then 



A)-'B). 



L 



C(/^iE-A)- 



C(/i,E-A)- 



E- [(;.iE-a)"'b ... (;.*e-a)"'b] 



St 



Using the fact that the system is completely controllable and observable, the 
rank of L is precisely the rank of E (assuming that h,k> rank(£)), which is the 
order of the minimal degree interpolant of the data. Last, in the special case of data 
consisting of a single point with multiplicity, namely: {sq\ (/jq, (pi, . . ., (t>p^i), i.e. 
the value of a function at sq and that of a number of its derivatives are provided, it 
can be shown that the Loewner matrix has Hankel structure. Thus the Loewner 
matrix generalizes the Hankel matrix when interpolation at finite points is con- 
sidered [1, 4, 5]. 

To help the reader understand this concept, we consider the following simple 
example. Suppose the underlying transfer function which generates the data is 
H(i') = ^ and we are given P — A samples of this function: 



(^,^)= 1 



2,; 



-1,- 



(4.4) 



We partition this set as follows: 

k — 2, {a,w) = 



h'^Jl,^]}, h^2,{fi,v) 



'4 



(4.5) 
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and build the Loewner matrix according to (4.3) 

L = i^ "■? e 



1 

24 

1 


1 
28 

1 


T8 


I 

21 



i2x2 



(4.6) 



It follows that the rank of L is 1, so the interpolant for these 4 points has 

1 

.s+5 



minimal degree 1, which is indeed the case, since H{s) — -^ is of degree 1 



4.3.1 Tangential Interpolation 

Tangential interpolation is a general form of the interpolation problem, where 
matrix data are sampled directionally on the left and on the right. Tangential data 
consist of the right interpolation data 



{(i,-,r,-,w,-)|l,-eC,r,-eC'«^',w,.eC''^'}, 

for i = I, . . .,k, or, more compactly, 

A = diag[;.i,..., 4]gC*^^ 

R=[ri,..., n-leC"^*, 

W=[wi,..., w,]eC''^^ 
and of the left interpolation data 

{{^ij,£j,yj)\fijeC,ijeC'^'\yjeC^^"'}, 
for j = I, . . .,h, or, more compactly, 

M = diag[^i,...,/i;,] e ( 

Vl 



~^hxh 



eC''"^V = 



v/, 



eC 



hxm 



(4.7) 

(4.8) 

(4.9) 

(4.10) 

(4.11) 

(4.12) 
(4.13) 



In this case, the rational interpolation problem consists of finding a realization 
in descriptor form [E,A, B, C,D], such that the associated transfer function 
H(i') = C{sE — A)" B + D, satisfies the right and left constraints 



H(;,-)r,- = w,-, ij^i^ij) = yj. 



(4.14) 



The tools for addressing this problem are the Loewner matrix, variously called 
the divided-difference matrix and null-pole coupling matrix together with the 
shifted Loewner matrix associated with the data. For the material that follows, we 
refer to [21] for details on proofs and derivations. 
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4.3.2 The Loewner and the Shifted Loewner Matrices 
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Given S = {^i , . . ., sp} points in the complex plane and the evaluation of a rational 
matrix function H(i') at those points: {H(5i), . . .,H(i'/))}, we can partition S as: 

s = {ii,...,;.ju{^i,...,^j, 

where k + h — P. We construct the right and left data by appropriately selecting 
the sampling directions r, and £, (see Sect. 4.3.4 for details). 

The Loewner matrix and the shifted Loewner matrix are both of dimension 
h X k, and are defined in terms of the data (4.7) and (4.11) as follows 



(4.15) 
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(4.16) 



Each entry in (4.15) and (4.16), for instance ' '_■' ' , is a scalar quantity, and is 

obtained by taking inner products of the left data (row vectors) and right data 
(column vectors). Assuming that the rational matrix function H{s) generates the 
data, the shifted Loewner matrix is the Loewner matrix corresponding to i'H(i'). It 
is straightforward to see from (4.15) and (4.16) that these matrices satisfy the 
Sylvester equations 



.A -MI 



LW - VR, and (tLA - McrL = LWA - MVR. 



(4.17) 



These expressions allow for an efficient MATLAB " implementation to ini- 
tialize the matrices, by avoiding nested "for" loops to initialize entries one by one. 



4.3.3 The Solution to the General Tangential Interpolation 
Problem in the Loewner Framework 



We review the conditions for the solution of the general tangential interpolation 
problem by means of state-space matrices [E,A,B,C,D], as presented in [21]. 

Lemma 1 Assume that k — h and that det(xL — crL) ^ O,for all x £ {?.,} U {/i,} 
(i.e., the matrix pencil (crL, L) is regular and /i-, 1; ^ /l(ffL, L)j. Then E — — L, 
A = —crL, B = V, C = W and D = w a minimal realization of an interpolant of 
the data. Thus, the associated transfer function 
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H(.?) = W((tL-5L)"'V 



(4.18) 



satisfies the left interpolation conditions ijH^fii) — V/ and the right interpolation 
conditions H(l,)r,- = w,-. 

We revisit the previous example from the point of view of tangential interpo- 
lation. We consider the same transfer function, namely \i-{s) ~ ■^, as well as right 
tangential data (/t,r, w) = (l, l,g) (i.e., k — \) and left tangential data {fi,i,y) = 
(-1, l,i) (i.e., h = 1). Compactly, we write this as A = [ 1 ], R = [ 1 ], W = [i] 
and M = [— 1],L= [1],V= [|]. The Loewner matrix pencil is built according 
to (4.15) and (4.16): L= [-^] eC"",(TL= [|f] e C""'. Lemma I's assump- 
tions are satisfied (k — h — 1, det(xL — crL) = det(x(— j^) — ^) 7^ 0, for 
X € {1, — 1} and fij = —1,/!/ — I ^ ^{ff = ^,L = —55))^ so we directly write 
the realization 



E,: 



1 

24' 



A,- 



5 
24' 



B. 



1 

4' 



C,= 



D = 



One can check that the transfer function associated to this realization H{s) = 

S (■^ 2? "'" ^) 3 ^^ ^^^ same as the original -L. 

Next, we discuss another example. The transfer function generating the data is 



H(.) = 



1 

5s + 6 



s -6 

1 s + 5 



(4.19) 



As tangential interpolation data we have 



;fe = 2, (l,r,w) 



'r 



1 


1 


)•(- 


0' 

1 


1 


-6 

_5 + 2/_ 


'5 + 5/ 


'2+10/ 



h = 2,{ii,£,y) 





T 



' -1 


i 
6 


') 


'5-5( 



-2i, 



Tangential data leads to the following matrices 



1 



2- 10/ 



(4.20) 



5-2/ 



(4.21) 



A: 



W 



2,7' ^ 

i -6 - 

5+5/ 2+lOi 

1 5+2i 

5+5/ 2+10/. 



-2i 



R 



,V = 



-1 -6 

5-5/ 5-5/ 

1 5-2/ 

2-IO1 2-10/ 
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L= ^iS . '''_£' , <tL= ^l\ «^«' . (4.22) 

.260 260 ' 104 J L65 ' 65' 52 

Since the lemma's assumptions are satisfied, we can write a realization as 



E, 



10 130 

-17 7_ • _ 

260 260 ' 



104 



K = - 



10 
L 65 ^ 65 



.18 ,12- 
65 ^ 65 ' 
15 

52 



(4.23) 



B. 



-i -6 

5-5i 5-5i 
1 5-2i 

2 



2-10( 



lOi 



C. 



i -6 

5+5i 2+10/ 

1 5+2i 

5+5i 2+10i 



(4.24) 



The transfer function is as expected: 



flr{s) ^Cr{s^r-Kr% 



1 



5s + 6 



s -6 

1 s + 5 



(4.25) 



Some remarks: 
• Instead of the matrix obtained by evaluating the transfer function at 5 = /, 



H(0 



P+5i+6 P+5i+6 

1 i+5 

?+5i+6 P+5i+6 



I -6 

5i+5 5(+5 

1 1+5 

5i+5 5(+5 J 



which one would use in matrix interpolation, tangential data consists only of the 
vector Wi = H(/)ri, which in our case is the first column of H(;), since ri is the 
first identity vector. A similar observation holds for W2 and the left data Vi, V2. 
• In the realization [E,., Ar,Br, Cr,Dj, the matrices have complex entries, so 
using the same matrix data and considering the tangential data as 



k^2, (i,r,w) 



r 




1 


1 


H- 


'1' 



1 


— / 
1 


'5 + 5/ 


'5-5/ 



(4.27) 



h^2,{^,e,\) 



= 2/, 



1 




2i 



10/ 



-2/, 



10/ 



-2/ 
-6 



(4.28) 
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(i.e., points and their conjugates on the same side) leads to the reaUzation 



E, 



-L = 



B,=V = 



65 65 

3,2; 



65 ' 65 



26' 



3_ 
26 



2__ 
65 

_± 
65 



26' 



26 26' 26 26' 





, A,= 


-(tL = - 


" 37 3 
130 130 
^ , J_ 

.130^ 130 


, c, = w = 


10^ lO' 
1 1 ■ 


Io~Io' 

1.1- 






.To~To' 


10^ lO'. 



21 
130 
37 
130 



130' 



130' 



D, = 02. 



After applying a coordinate change (see Sect. 4.4 for details): 11 = 4= 
a realization with real entries is obtained as: 



1 -;■ 
1 J 



E, = - L = 



B, = V = 





1 
65 


4 " 
65 




8 
65 


7 
65. 


486 
1787 


460 1 
2819 


473 


1458 


S 


696 


1787 J 



Ar = — (TL = 



c, = w = 





29 1 
65 65 




2 8 
.65 65 


197 


197 


1393 


1393 


197 
1393 


197 
1393 J 



D, = 



The associated transfer function is as in (4.25), which is what we expected. 
The tangential directions in (4.27) and (4.28) are equal, i.e., rj — r2, 
€1=^2= Fi^, but this still allows us to identify the system (see Sect. 4.3.4 for 
the theoretical reasoning). 



4.3.4 System Identification 



In this section, we show that one is able to identify the underlying system given the 
right number of measurements. Assume that our system has McMillan degree «, 
that the number of inputs and outputs is equal (m = p) and that the underlying 
D matrix is full rank. Thus, we are looking for a realization of the system of order 
n which is minimal (both controllable and observable). 

Let us begin our argument by noting that, even though our realizations have 
D = 0, we can still identify systems with a non-zero D. This is due to the fact that the 
D-term can be incorporated either in the B or the C matrices. Assume a realization of 
the system as [E, A, B, C, D] with E, A invertible. The following realization 



E: 



E 



, A = 


A 
I 


, B = 


B 
I 



C = [C D], D = 



(4.29) 



corresponds to a system of order n + p (i.e., both A and E are of dimension 
(n + p) X (n + p)), but a zero D-term. One can easily check that the transfer 
function of the system in the new realization is as before. 
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Next, we show that when samples atn+p left tangential data and n + p right 
tangential data are provided, and the conditions in Lemma 1 are satisfied, we have 
recovered the underlying system. 

The number of free parameters for a rational matrix function H(.j) £ O'^'' with 
D 7^ and McMillan degree n is 2np + 2p , which is obtained by considering the 
control canonical form [18] of the system. Thus, we can freely choose n entries on 
p rows of A, p entries on p rows of B, np entries of C, as well as the p entries of D. 
On the other hand, the number of free parameters in tangential interpolation are 
2(n + p) for Aj, fij,i, j — I, . . .,n + p, as well as the tangential directions 

Tj eC'"'\i^ l,...,n+p and £j £ C^""'', j = I, . . .,n+ p. Note that once the 
sampling points 1, and fij and the sampling directions r, and Ij were chosen, the 
left and right data W; and Vj are obtained as H(/l,)r; = W; and £jH{fij) = Vj, 
respectively. This gives a total number of free parameters of 2(« + p) + 
p(n + p) + (n + p)p — 2n + 2p + 2np + 2p . Thus, the number of the free 
parameters needed to uniquely define the transfer function matrix is smaller than 
the number of unknown parameters when n + p tangential measurements are 
chosen on the left and on the right. 

In the following, we show that the Loewner matrix constructed from tangential 
data still has a system theoretic interpretation in terms of tangential controllability 
and observability matrices. A similar expression exists for the shifted Loewner 
matrix. Given n + p left interpolation data Vj = £/H(/i,), j = I, . . .,n + p, and the 
same number of right interpolation data w, = H(/!,,)r,-, i — 1, . . ., n + p, we have 
that 

^ _ v^r,- - £yw,- _ iMfii)ri - ^;H(;,)r,- 

fij - ;.,■ fij - Ai 

C{^jE - A)"'B + d - C(;./E - A)"'B - D 
■' t^j - h 

(».E-A)"' -(;.,■£ -A)"' 

\-h - ^■'■ 

= -£,-C(^(,-E - A)"'e(a,-E - A)"'Br;, ;, / = 1, . . .,m + /7. 
Similarly, 

U;V,T,- — hiiS^i Uiil^iE — A)"' — /t,(l,E — A)"' 
alLjj = ^-^-^ p^ = ijC^-^^ A^ —Br; + ijDri 

jXj — Aj fij — A, 

/i;C(A(/E - A)"'B + fi:D - liCUiE - A)"'B - 1/D 
_ « _j J J _ J. . 

flj - Ai 

= -ijCii-ijE - A)"'A(i,-E - A)"'Br; + ^yDr;, ;, i=l,...,n+p. 



r, 
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Thus 



L: 



£iC(//iE-A)"' 
_4+j.C(//„+pE- A) 



E[(;.iE-A)-'Bri ... (i„+^E-A)"'Br„+p 



(4.30) 



ffL^ 



£iC(/(iE-A)"' 
4+/,C(/i„+pE- A)" 



a[(a|E-A) 'Bn ... (a„+,,E-A) 'Br„+^] 



+ LDR. 



(4.31) 



The pairs (E, A, B) and (C, E, A) are controllable and observable, respectively, so 
provided that the sampling directions are chosen appropriately, the rank of the 
Loewner matrix is precisely the rank of the underlying E matrix, while the rank of 
the shifted Loewner matrix is p more than the rank of the underlying A matrix, 
which is as expected from (4.29). Next we provide an outline on how to choose the 
sampling tangential directions: 



If D=0, then L = 
sampling direction 
Ti = r2 = . . . = r„+p. 



-CE.^ and crL — —(Sh.'M and we can choose the same 
for all sampling points: t\ — l^ — ■ ■ ■ = tn+p and 
This is because (E,A,B) is controllable, so by using a 



linear combination of the columns of B, one can still control the system. Sim- 
ilarly, as (C, E, A) is observable, choosing a linear combination of the rows of C 
still makes the system observable. Denoting £iC by c and Bri by b, we have that 



L: 



c(/iiE - A) 



c(/^„+„E- A) 



[(;.,E-A)-'b ... (;.„+^E-A)-'b] (4.32) 



and similarly for crL. 

The statement above does not hold when (A, E) has at least one multiple eigen- 
value with the same algebraic and geometric multiplicity (e.g., E = A = I), as 
one would need more than one vector b to make (E, A, b) controllable. Similarly, 
more than one vector c is required to make (c, E, A) observable. Hence, the 
sampling directions for each evaluating point should be different from the others. 
If D 7^ 0, (4.31) shows that the D-term cannot be recovered unless we choose at 
least p linearly independent vectors r, and £j. In particular, one can choose the 
sampling directions as unit vectors of dimension p (see Sect. 4.4). 
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Lemma 1 provides a realization of the system in terms of the Loewner matrix 
pencil together with the V and W matrices. Therefore, this method constructs a 
descriptor-form representation of an underlying dynamical system exclusively 
from the data, just by arranging it in a convenient form, with no further manip- 
ulations involved. 

Consider the following transfer function 



H(.)=. 



1 



5.S + 6 



s -6 

1 s + 5 



1 2 
3 4 



(4.33) 



Note that this time, D 7^ 0, so the purpose of this example is to show how the 
D term affects the realization. Suppose that we are given the matrix data at /, 2i, 3i, 
Ai and their complex conjugates (i.e., H(i), . . .,H(4i) and H(— /) = H(i), . . ., 
H(— 4/) = H(4/)). We construct tangential data as 

k = A,{Lr,yi) = {(;\ei,H(;>i), (-!',e,,H(-Oei), (3/,e2,H(3;)e2), ("3i,e2,H(-30e2)}, 
A = 4, (/^, i?, v) = { (2i, e/, e/H(20) , (-2/, e/, e/H(-20) , (4;, e2^ e2^H(40) , 
(-4i,e2^e2^H(-40)}, 

and obtain the following realization, after performing the appropriate basis change: 



E, 



-L= - 



1 

130 

± 

65 

1 

250 

4 

125 



A,. = — ffL = — 



B. 



31 
26 

1_ 

26 
149 
50 
_L 
25 



■ 159 
130 

J_ 

65 

743 

250 

2 
■125 

49 ■ 

26 

__15 
26 
203 

50 

2_ 

25 ■ 



_2_ 
65 
7 

130 

1 

125 

3 

125 

\_ 

130 
± 
65 

L 

250 

4_ 

125 





J_ _ 

13 
2 
325 
38 
975 
29 
13 



1291 

325 



325 



26 
_^ 
26 
19 
650 
83 
975 



13 
6 

325 
38 
325 



c, = w 



11 

10 


1 

10 


27 
13 


5 
13 


31 
10 


1 
10 


161 
39 


14 
39 



Dr = 0, 



The singular values of E;., which is the Loewner matrix up to a sign change, are 
j^ , (^ ,0,0, while those of A^ are ^^ , j^ , jj^ , 5^p)3, so indeed our realization 
has order 4 = 2 + 2 — n+p with A^ full rank and E^ of rank 2 = m. Moreover, 
the computed eigenvalues of the matrix pencil are —5 • 10'"^, 2 • 10^^, —3, —2. The 
first two correspond to poles at infinity, while the last two are precisely the poles of 
our original system (given by the roots of the equation s^ + 5s + 6 = 0). 
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4.4 Tangential Interpolation for Modeling Y-Parameters 

The data sets are assumed to contain P samples of the multi-port Y-parameters, 
Y''', at frequency points yo),-, for i = 1, ...,/". To obtain a real system, the condition 
H(i') = H(^) needs to be satisfied. Therefore, the Y-parameters at the complex 
conjugate values of the sample points — jco, should equal the complex conjugates 

of the measurements at joOj, namely Y . 

The right interpolation data can be chosen as 

~Ai = ycu,-, l,+i = -jwi] r,-, r,+i = r,- w/ = y''V,-, w,+i = wj" = Y ' r; j , 
i=l,3,...,P, 

where co, = 2nfj e R. The right sampling directions may be chosen as fi = r2 = ei € 

^'''^ , r3 = r4 = e2,. . ., r2p-i = r2,, = e^,, rip+i = r2p+2 = Ci , r2p+3 = r2p+4 = 62, . . ., 
where e„, denotes the m-th column of the identity matrix I^, m=l,...,p. Conse- 
quently, the right data are columns of Y*'' and Y*'' , e.g. Wi = Y* 'j' e C"" ' , W2 = Y* j' , 
W3 = Y: 2 , W4 = Y. 2 , etc. More compactly, 

A = diag [joji , -jcoi JCO3, -jco^, . . .Jcop-i, ~jcop-i], (4.35) 

R=[ri,r2 = ri,r3,r4 = r3,...,rp_i,rp = rp_i]eC''^'', (4.36) 

W = [wi, W2 = wT, W3, W4 — W3, . . .,wp_i, wp = wp_i] e C'^^, (4.37) 

The left interpolation data is constructed as 



=M+i , f^i+i = -M+i ; ^iJi+i = -^iS V; = ^■y('+" , Vi+i = V; = £i\ '^' 



i=l,3,...,P 



(4.38) 



The left sampling directions if may be chosen as rows of the identity matrix I^, 
so the left data are rows of the admittance parameters Y*'' and Y . Compactly, 

M == diag [;ffl2, -jco2jm4, -jm, ■ ■ -Jcop, -jojp] , (4.39) 

L* = [£* il q t^ ■■■ tp £*p], LeC^^'', (4.40) 

V* = [vi* vT* V3* V3"* ••• vp* v/^*], YeC''"'''. (4.41) 

Without loss of generality, we assumed that we have an even number of 
samples. After tangential data are constructed, the Loewner and shifted Loewner 
matrices are built according to formulas (4.15) and (4.16). The following change 
of basis needs to be performed to ensure that the matrix entries are real: 

A = fi*Afi, M = n*Mn, L = n*L, v = fi*v, r = Rn, w ^ wn l = n*Ln, 

ah — YVahtl, where 
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n = diag[n,...,n]ec^^^ n = i= 



1 -j 
1 j 



Note that n*n = nn* = h. 

When too many samples are available, the pencil obtained after using all 
measurements is singular. Hence, the assumption that the Loewner pencil is reg- 
ular does not hold since the realization given in Lemma 1 is not minimal. The 
underlying system can still be identified by plotting the singular values of the 
Loewner and shifted Loewner matrices and noticing where the largest drop in 
singular values occurs. Thus, the singular part can be projected out via a singular 
value decomposition: 

xL — crL = YZX, x G {j(Oi, —jOi} , ' = 1 , • ' ' : ^; 

where rank(xL — crL) =: A^ (the dimension of the regular part), \ £ <C ^^ and 
X e C'^^ . For strictly proper systems, N — n, while for proper systems 
N = n + rank(D). Using the singular vectors as projectors, the realization is given 
as 

E = -Y*LX, A = -YVLX, B = Y*V, C = WX, D = 0. (4.43) 

This approach is costly for sets with a large number of samples k, as the 
complexity of the SVD of xL — ah is 0(k ). The adaptive and recursive proce- 
dures presented in [20] are cheaper, but still identify when the pencil becomes 
singular. 

Remark Our algorithms are able to recover the underlying system, therefore, for 
data sets coming from real-world systems, the resulting models are stable (after 
extracting the necessary D-term). 



4.5 Numerical Results 

We investigate two examples arising in circuit simulation. The first is an infinite- 
dimensional delay system, for which we build a finite-dimensional, low-order 
model which captures the low-frequency behaviour. The second is an RC circuit 
with 70 ports for which we compare the tangential and the matrix interpolation 
approaches. 



4.5.1 Delay System 

Electrical circuits with transmission lines can be described by differential equa- 
tions containing delays. These expressions are obtained from the d'Alembert 
solution to the telegrapher's equations [23]. The delay factor turns out to be ^/LC. 
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In the following, we consider the system: 

Ex(f) = Aox(f) + Aix(f - t) + bu{t), y(f) = cx(f), (4.44) 

with E, Ao, Ai e K^™^5™ and t = 1. The associated transfer function is given by 



H(5) = c{sE - A,) - Aie"")"'b. 



(4.45) 



The matrices E, Aq and Ai are symmetric. Moreover, E is positive definite, 
while Ao and Ai are negative definite. Due to the time delay, this is an infinite- 
dimensional system. However, as the matrices E, Aq and Aj are diagonalized by 
the same transformation, the system can be decoupled into 500 independent 
equations and the poles of the system can be found by solving nonlinear equations 
of the type: 



sae - Aa„ -^-A,e " = 0, 



(4.46) 



where 1e is an eigenvalue of E, and aa„ and /Iai are eigenvalues of Aq and Ai, 
respectively. Setting s as x+j-y allows us to write (4.46) as a system of 
2 nonlinear equations corresponding to the real and imaginary part: 



xIe - M„ - /Ia,cos(tx) = 0, y?.E + ^a, sin(Ty) = 0. 



(4.47) 



Figure 4.1a shows poles with imaginary part between ±130. Each of the 500 
systems of equations of the form (4.47) were solved using the MATLAB function 
f solve with initial guess for x and different guesses for y, between —130 and 
130. Thus, for each value on the imaginary axis, there is one pole obtained from 
solving one of the systems of nonlinear equations. Even though Fig. 4.1a shows 
only poles with imaginary part between — 130 and 130, they extend well above that 
value. 

We are looking for a low-order finite-dimensional model as in (4.1) to 
approximate this infinite-dimensional system. Our approach consists of taking 
1000 measurements of the frequency response logarithmically distributed between 
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Fig. 4.1 Poles and frequency response of the delay system: a Poles b Frequency response 
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10 



■ and 10 ■ and using this information together with the theoretical concepts 



described in Sect. 4.3 and the numerical implementation presented in Sect. 4.4 to 
construct such a reduced model. We also employ state-of-the-art vector fitting in 
our comparison. The accuracy was assessed using two error measures: the nor- 
malized ,5f oo-norm, 



^fx! error — 



max,=K...i|r('' - HmodeiC/ • 27t/;-)| 



maxj^i . 



lyWl 



which evaluates the maximum deviation in the singular values and the normalized 
Jf 2-norm of the error system, 



Jf^^2 error = 



\ 



ELi|y'''-Hn,odeia-27r/;)P 

Ell I >-("!' 



which evaluates the error in the magnitude of all entries, proving to be a good 
estimate of the overall performance. 

Let us begin our discussion with the analysis of the singular value drop of the 
matrix pencil ((tL,L) constructed using all measurements. This is shown in 
Fig. 4.2. We notice that the rank of the pencil (crL, L) constructed using these 
measurements is about 700. Since the system is infinite-dimensional, one may 
expect the matrix pencil to be full rank. The fact that it has a numerical rank of 
700, instead of 1000, is because of the small magnitude of the response and small 
oscillation peaks at high frequencies, which can essentially be approximated by a 
line. 

We build models of order n = 41. Results of the comparison of our approach 
against vector fitting are shown in Table 4.1, while Fig. 4.3 presents plots of the 
errors in absolute value between the original system and the different reduced 
models. 



Fig. 4.2 Drop of the singular 
values of the matrix pencil 
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Table 4.1 Results for n = 


= 41 order models of an infinite-dimensional delay system 


Algorithm 


Iterations M'rx, error .3^2 error 


SVD Approach 
Vector Fitting 


N/A 1.4995e-002 3.5498e-002 
50 3.4159e-002 5.4146e-002 
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Fig. 4.3 Models obtained with our approach and with vector fitting: a SVD approach b VF 
(50 iterations) 
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Fig. 4.4 ,yfrx, and .i^'? errors for our approach compared to vector fitting: a .J^^^ error b .J^i 
error 



Figure 4.4a compares the ^oo error between all our models with orders 
between n — \ and n — 5Q and the same order model built with vector fitting (in 
50 iterations). Similarly, Fig. 4.4b shows the corresponding .yf 2 errors. We notice 
that, in most cases, the errors for our models are lower than the VF models. 
Moreover, in Fig. 4.4a, we observe that VF reaches a plateau, in the sense that the 
errors stay almost constant with increasing the model order. The opposite 
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behaviour happens for our models, as they become better when we increase the 
desired dimension. 

Since vector fitting is solving an optimization problem, its performance greatly 
depends on the number of iterations. Figure 4.5 shows the decay of the .^^c and 
Jf 2 errors as a function of the number of iterations performed (between 1 and 50) 
when the procedure starts with 41 starting poles. The errors stagnate after about 20 
iterations. 



4.5.2 Circuit with p = 70 Ports 



We analyze a dynamical system which describes a linear subcircuit of a large post- 
layout circuit (i.e. with parasitics due to layout). It was provided by Qimonda AG 
under the name rc549 and has previously been investigated in [25] from the point 
of view of terminal reduction. One of the approaches currently used for model 
order reduction of systems with a large number of terminals is terminal reduction 
[9, 10]. This system has 141 poles: 84 infinite, 56 real between —1.9 • lO'^ and 
— 1.1 • 10'^ and one at —2 • 10"'. The poles are depicted in Fig. 4.6b and the 
frequency response of this system is shown in Fig. 4.6a. 

Our goal is to compare the tangential approach to the matrix interpolation 
approach on this example arising in circuit simulation. For tangential interpolation, 
we take P — 400 measurements of the transfer function logarithmically distributed 
between lO' and 10^° and, for each measurement, we select one sampling direction 
as one of the unit vectors of dimension p. Thus, for each frequency measurement, 
we use either a column (as right data) or a row (as left data) of the matrix 
measurement. The resulting Loewner and shifted Loewner matrices are of 
dimension 400 and a plot of their normalized singular values is shown in Fig. 4.7a. 
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For matrix interpolation, we take P — 6 measurements of the transfer function 
logarithmically distributed between lO'' and 10^° and, for each measurement, we 
select p sampling directions as p unit vectors (all columns and rows of the identity 
matrix of dimension p x p are used as right directions and left directions, 
respectively). Thus, for each frequency measurement, we use the entire 
p X p matrix measurement. The resulting Loewner and shifted Loewner matrices 
are of dimension 6 x 70 = 420 and a plot of their normalized singular values is 
also shown in Fig. 4.7a. We notice that both methods yield Loewner and shifted 
Loewner matrices of the same rank, which suggest that the underlying system can 
be reduced to order n = 127. 



4 Model Order Reduction with Applications to Circuit Simulation 



105 



Table 4.2 Results for the 
tangential and matrix 
interpolation approaches 



Approach 



•J^n. 



J^2 error 



Tangential interpolation 
Matrix interpolation 



3.3422e-010 
6.9040e-009 



3.3589e-010 
6.9056e-009 



Figure 4.7b shows the finite poles of the original system, together with the poles 
of the reduced systems obtained with the tangential and the matrix interpolation 
approaches. We notice that all poles were recovered. In particular, we notice that 
the reduced systems of order n — 121 with D = contain 57 finite poles (the same 
as the original one), and 70 infinite poles which hide an underlying non-zero 
D-term, as in (4.29). Hence, all redundant poles have been eliminated.. 

Similarly to the scalar case (the delay system), we use the following error 
measures to asses the accuracy of the reduced models with respect to the 
measurements: 



J^^ error : 



max,-^i.„P (7i (HmodeiO • 2nfi) - Y*'^ 
max,=i„.pffi(YW) 



J^2 error = 



\ 



Ell H.™dei(/--27i/;-)"Y« 



V* IIyWI 



where cr i ( • ) stands for the largest singular value of the matrix ( • ) and 1 1 • 1 1 ^ stands for 
the Frobenius-norm, which computes the sum in the magnitude squared of all 
entries. For devices with many ports, computing the error in each entry is unfeasible, 
as forp = 70 ports, the errors in all 70~ — 4900 entries would have to be assessed. 
Instead, these error measures give a good indication of the model's quality. 

Table 4.2 shows that both approaches yield small errors, with the tangential 
interpolation approach yielding slightly smaller errors than matrix interpolation. 

Figure 4.8 shows the frequency response (sigma plot) of the top left and bottom 
right 10 X 10 entries of the original system, together with the reduced systems 
obtained via the two approaches: tangential and matrix interpolation. 

The purpose of this example is to show that tangential interpolation yields 
comparable results to matrix interpolation provided that the sampling directions 
are chosen as outlined in Sect. 4.3.4. Moreover, both methods are able to eliminate 
the redundancy and produce reduced models which are very close to the original. 



4.6 Conclusion 



This paper discusses topics in model order reduction related to accurate macro- 
modeling from frequency-domain data. In particular, the system theoretic concept 
of the Loewner matrix pencil constructed in the framework of tangential 
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interpolation is analyzed from a theoretical point of view, by showing that a 
system can be recovered from tangential interpolation data given that the right 
amount of measurements are available and the sampling directions are chosen 
appropriately. We applied this concept to two numerical examples arising in cir- 
cuit simulation: a delay system and a subcircuit with a large number of ports. 
Several other numerical examples which support our approach are presented in 
[19, 20]. 
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Chapter 5 

Forward and Reverse Modeling of Low 

Noise Amplifiers Based on Circuit 

Simulations 



Luciano De Tommasi, Joost Rommes, Theo Beelen, Marcel Sevat, Jeroen A. 
Croon and Tom Dhaene 



Abstract Forward and reverse modeling of RF circuit blocks are useful approa- 
ches in design space exploration. The underlying idea of forward modeling is the 
creation of accurate surrogate models, which can be used to predict the circuit 
performances replacing (expensive) circuit simulations. On the other hand, reverse 
modeling concerns multiobjective optimization to explore relevant trade-offs 
between performances. This paper provides a discussion of application of surro- 
gate models and multiobjective optimization to narrow-band low noise amplifier 
design. We discuss numerical difficulties encountered when the forward model is 
derived by using surrogate models of low noise amplifier admittances to compute 
performance figures via analytical equations. Afterward, we provide an example 
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where direct performace modeling leads to a more accurate result even when the 
simplest surrogate model type (a lookup table) is used. Finally, a detailed tutorial 
of the normal boundary intersection optimization method is provided. 

Keywords Response surface modelling • Surrogate modelling • Forward mod- 
elling • Reverse modelling • Low noise amplifier • Design space exploration • 
Multiobjective optimization • Pareto front 



5.1 Introduction 

An RF circuit block, such as a Low Noise Amplifier (LNA), shown in Fig. 5.1, is 
characterized by performance figures (e.g., voltage gain, linearity, noise figure, 
etc.) which are functions of the design parameters (e.g., width and length of 
transistors, bias conditions, values of passive components) [10]. The goal of the 
design process is to figure out one or more sets of design parameters resulting in a 
circuit which fulfills the specifications, i.e., constraints given on the performances. 

Nowadays most of the RF integrated circuit design methods are still based on 
experience. Typically, circuit designers start guessing one or more solutions of the 
design problem, possibly relying on simplified mathematical models reflecting 
coarse approximations of the physics describing the circuit. Afterwards, they 
verify the circuit behavior through circuit simulations, which instead exploit 
highly accurate models of electronic components. Design parameters are then 
adjusted until circuit specifications are achieved. Hence, there is a demand from 
industry for the development of systematic methods and tools, which fill the gap 
between the generality of several existing modeling and optimization software and 
the concrete applications at hand. 

The design problem can be formulated as a single objective optimization 
problem, where a single performance or a weighted sum of performances is 
minimized. For example, typical LNA design strategies try to minimize noise 
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figure (NF) (e.g. [12]) or power dissipation or to maximize linearity, while 
enforcing proper constraints on the remaining performances. 

A more insightful and systematic approach than single objective optimization, 
even though computationally more expensive, is multi-objective optimization. 
Circuit performances normally are in conflict with each other: for instance, to 
improve the voltage gain of an LNA, the linearity has to be reduced. Hence, a 
multiobjective optimization method (e.g. the normal boundary intersection algo- 
rithm (NBI) [3, 16] or the strength pareto evolutionary algorithm 2 (SPEA2) [15, 
20]) can be used to figure out and graphically display such trade-offs as Pareto 
fronts. Since the sample points belonging to a Pareto front are obtained by opti- 
mizing performances with respect to design parameters, the result of such multi- 
objective optimization is a 'reverse model' of the circuit block, which determines a 
correspondence between samples of the Pareto front and samples belonging to the 
design space. 

Design tools based on optimization methods can be classified in two categories: 
simulation-based tools and stand-alone tools. To compute circuit performances in 
the optimization loop, the former category of tools calls a commercially available 
circuit simulator, such as Spectre or Hspice. The second category includes a built- 
in performance evaluation module based on a symbolic performance model of the 
circuit [11, 14, 18]. Although symbolic models enable a faster evaluation of 
performances than circuit simulations, their accuracy is lower. For this reason, the 
authors of [18] propose to use their tool as an "auxiliary tool generating an initial 
design for a commercial design environment". On the other hand, an extensive 
exploration of the design space with simulation-based tools is limited by the high 
computational cost of numerical simulations. 

In this work, a simulation-based methodology, aiming to combine accuracy of 
circuit simulations with computational efficiency of symbolic models, is described. 
Circuit simulations are replaced by a cheap-to-evaluate surrogate model, derived 
from a proper set of precomputed simulations [ 6, 7, 14, 19]. An approximation of 
the circuit performance figures based on one or more surrogate models is referred 
to as 'forward model' of the circuit. A forward model can be either obtained via 
direct modeling of circuit performances or by using surrogate models of a con- 
venient set of behavioural parameters (e.g. admittances and noise functions) to 
compute performances via analytical equations. The former approach involves the 
direct approximation of the function: 

P = p(x,y), (5.1) 

where p — {pi ■ ■ ■ Pj) is the vector of circuit performances, x = (xi • • -x^) is the 
vector of design parameters and y = (yi • • • yi) is a vector holding extra-circuit 
parameters (e.g. load impedances). Surrogate modeling involves the identification 
of a vector of functions: 

P = p(x,y), (5.2) 

such that p « p. 
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Circuit performances can also be written as functions of the behavioural 
parameters b = b(x) and the extra-parameters y : 

p = p(b(x),y). (5.3) 

The latter modeling approach consists in the approximation of the behavioural 
parameters b = b(x) through the surrogate models b = b(x), such that b « b. The 
forward model is then given by: 

p = p(b(x),y). (5.4) 

Modeling of behavioural parameters is attractive because they do not depend on 
the extra-parameters y, therefore the approximation problem is simplified. How- 
ever, in this paper we show that forward modeling from behavioural parameters is 
likely to introduce an unacceptable amplification of the numerical error, i.e. b « b 
does not necessarily imply that p « p. Therefore, direct approximation of per- 
formances has to be currently preferred, even though the inclusion of all circuit 
parameters in the model is still an open problem. 

Methods described in this paper have been implemented into a prototype of an 
auxiliary design tool. Currently, it can only handle narrowband LNA design and 
does disregard circuit layout optimization. Further extensions can enable wideband 
and multiband LNAs design [5, 12], and layout-level optimization [13]. 

The paper is organized as follows. In Sect. 5.2, forward and reverse modeling 
problem descriptions are introduced. Section 5.3 focuses on forward modeling and 
the numerical error amplification problem. Section 5.4 describes reverse modeling 
via NBI optimization technique. Section 5.5 explains how a reverse model can 
exploit a forward model to speed-up the computation of a Pareto-front. Section 5.6 
discusses presented results and concludes the paper. 



5.2 Forward and Reverse Modeling: Problem Descriptions 

A forward model of a circuit determines a correspondence between design 
parameters and its performances. It involves the generation of a set of surrogate 
models or response surface models (RSMs), which are cheaper to evaluate than the 
set of circuit simulations needed for evaluating the same performances. On the 
other hand, a reverse model links Pareto-optimal points of the performance space 
with the corresponding design parameters. 

A block diagram showing the connection between circuit simulations, surrogate 
models, design parameters and performances is depicted in Fig. 5.2. Computa- 
tional times are indicative. In particular, the computational time needed to generate 
the forward model depends on the specific set of functions that are approximated 
via surrogate modeling. 

A typical setup used to build a forward model of an RF circuit block consists of 
modeling software (e.g. the SUMO Toolbox [17]) linked to a circuit simulator. The 



5 Forward and Reverse Modeling of Low Noise Amplifiers 



115 



Fig. 5.2 Block diagram 
showing the connection 
between circuit simulations, 
surrogate models, design 
parameters and performances 






Simulation 
3 mm 




• Design Optimization 
may require > Ik 
simulations 

• RSM in library 



^ 




W 



Evaluation 
1 ms 



Ls 



I 



Gain, Nose, Admittgnoe, , 



modeling software directly drives the circuit simulator, requesting new simulations 
in order to accurately cover the entire design space. Additional data samples may be 
selected according to adaptive sampling techniques, to allocate more samples in 
those regions of the design space which are more difficult to model and/or where the 
error is the largest [1]. This way, the obtained model is intended to be valid 
throughout the entire design space. This approach was adopted in [6, 7]. 

Several design problems can be formulated as multiobjective optimization 
problems, involving the simultaneous optimization of two or more conflicting 
objectives subject to constraints. For instance, for an LNA one wants to maximize 
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linearity IIP2 but at the same time, maximize the voltage gain A^. as well. Since 
these are two conflicting objectives, the trade-off has to be studied. One way to do 
this is to compute a Pareto-front. An example is given in Fig. 5.3. 

The final result of such optimization process is a reverse model, which provides 
the design parameters corresponding to each Pareto-optimal solution. In the 
example shown in Fig. 5.3, the left side plot shows the points of the design space 
(W„, Lmn)' which correspond to the Pareto-optimal points of the right side plot. 

Since the construction of a reverse model requires a lot of performance eval- 
uations, it could exploit a forward model in order to make the computation of 
Pareto-front fast. This will be shown in Sect. 5.4. 



5.3 Forward Modeling 

Forward modeling of RF circuit blocks is a challenging research area, since it 
demands to obtain accurate models with a small number of data samples distrib- 
uted in high dimensional design spaces. For instance, a forward model of an LNA 
should provide the following performance figures: power consumption P, voltage 
gain Ay, input reflection Fq, second-order linearity IIP2, third-order linearity //P3 
and noise figure NF, as functions of the most important design parameters. 

In [7] the authors used the SUMO Toolbox [17] to generate surrogate models of 
LNA behavioral parameters (admittances and noise functions). Such behavioral 
parameters were given by a simplified analytic model, in order to allow a fast 
model identification. It was tried to achieve a Root Relative Square Error (RRSE) 
< 0.05 with a maximum number of samples of 1500. Results indicated that this is 
possible only including the first four parameters as 'inputs' of the model W, L^,/, 
L (when V(js and L,„ are kept constant). 

A study reported in [4] considers a complete characterization of an LNA pro- 
vided by circuit simulations. This includes the complete set of admittances needed 
to describe the weakly non- linear behavior [2], which were not taken into account 
in [7], but the number of inputs was limited to two (W and L^). Results indicate that 
a greater number of samples is needed to achieve a similar level of accuracy as 
using the analytic model. 

In order to build a forward model, surrogate models of admittances and noise 
functions have to be used to compute performances (P, A,,, r„, IIP2, IIP3 and NF), 
which are expressed as analytical functions of admittances and noise functions.^ 
This means that errors in the surrogate models may be amplified and hence result 
in inaccurate performance models (a general discussion of this issue is provided in 



' The subscript 'n' indicates that design variables Wand L„, have been normalized such that they 
vary between — 1 and 1 . 

" Admittances and noise functions are the elements of the vector b introducted in Sect. 5.1. 
Zj., Z/ and L„ are elements of the vector y introduced in Sect. 5.1. 
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Fig. 5.4 IIP2 as function of W and L, (computed by means of surrogate models) 

the introduction). On the other hand, direct modehng of performances would 
require the inclusion of the source and load impedances, respectively Z^ and Z/, as 
additional inputs (Fig. 5.1).^ This would make the modeling more difficult. With 
the computation of performances from admittances and noise functions, the load 
conditions enter into the analytical expression and do not need to be modeled. 



5.3.1 Performance Figures via Surrogate Models 

Forward modeling from behavioural parameters is prone to introduce an unac- 
ceptable amplification of the surrogate model numerical error when computing 
circuit performances from behavioural parameter models. This is demonstrated by 
the following example. 

Performance figures are expected to be smooth functions, 'local' spikes are not 
expected. We computed IIP2 with respect to the design parameters W and L„ by 
exploiting the models computed in [9] (Fig. 5.4). It is seen that several spikes 
appear. IIP2 can be expressed as function of the ratio of two voltages V2i/v2p'- 



IIP! = lOlogio 



1000 
"8^ 



(5.5) 



where V21 is the voltage amplitude of the 'wanted' signal at output and V2p is the 
voltage amplitude of 'unwanted' second harmonic at output [10]. It can be verified 
that all spikes are due to V2p and l/v2p, being V21 a smooth function. 



Furthermore, by decomposing V2p into numerator and denominator V2p 



den 



can be verified that, being den a smooth function, all the peaks are due to num and 
\lnum (Fig. 5.5). 



IIP! is expressed in dBm, i.e. logarithmic scale. 
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In particular, 'positive' peaks of IIP2 are due to peaks of — L. They appear when 
num assumes values close to zero. On the other hand, negative peaks of IIP2 are 
due to peaks of numf" 

The origin of 'negative' peaks is easy to investigate, since the function num 
may be readily decomposed into a sum of several terms, which are functions of 
surrogate models. A complete analysis is out of scope of this paper, we just 
provide a single example. A subpart of num, num' , is proportional to the product of 
two admittance surrogate models: num' oc yioioopypOOO/jO- The model of yioioop has 
a small peak due to modeling errors, which is amplified when multiplied by 
ypOQOpo, since it is located in the region of the main resonance of jpooopo (Fig- 5.6). 
This peak is reflected also onto IIP2. 



5.4 Reverse Modeling with the NBI Method 



As indicated in the introduction, a natural way to deal with trade-offs between 
performances is to use a suitable multiobjective optimization technique. 

Multiobjective optimization algorithms can be categorized globally into 
deterministic and evolutionary methods. Examples of such methods are NBI 
method and SPEA2, respectively. In this paper, we focus on the NBI method [3]. 

To outline this method we have to introduce a few notions in a more formal 
mathematical setting. The design parameters and performance parameters are 
assumed to be in the design space S' and performance space ^, respectively. We 
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assume that the region @ is feasible, i.e., all design parameters in 3) satisfy the 
imposed constraints: 

^ = {x e ^"1 c(x) < 0} with c(x) € m. (5.6) 

The performance space 0* contains all performance values resulting from feasible 
design values in the region Qi after applying a performance function f. Further- 
more, the performance values can also be constrained, i.e., 

,^ = {f e 3t\ ^^i - f(x),g(f) <0}. (5.7) 

The general problem at hand is the minimization of several objectives simulta- 
neously; i.e.. 



minimize f(x) 



such that g(f) < 0. (5.8) 



However, in general there is no single x G S* that minimizes all components/), i = 
!,...,« simultaneously. Consequently, the multi-objective optimization (5.8) leads 
to trade-off situations where it is only possible to improve one performance at the cost 
of another. A trade-off situation needs to be based mathematically on a so-called 
dominance relation between vectors (see for example [16]). 

Definition 1 Let a and b be vectors in ^". Vector a = (ai, . . .,«„) is said to 
dominate vector b = {bi, . . .,b„) if and only if 

a ^ b :^ V,-g{i,...,„}(a,- <Z7;) A 3,-g{i_.,.,„}(fl;<fe,-)- (5-9) 

With this definition, a performance vector/* f* is said to be Pareto-optimal if it 
is nondominated within 3P, i.e., 

-3fe,/[f^r]. (5.10) 

The set of all Pareto-optimal points is called the Pareto front of 0*. The set of 
points in the design space corresponding to the Pareto front are called the source of 
the Pareto front. 

In literature there are several methods available to calculate the Pareto front. In 
this paper we will describe the commonly used Normal-Boundary Intersection 
method (NBI method [3, 16]). We will closely follow [16]. 

For simplicity reasons we describe the NBI method for the 2D case only. Thus, 
we assume the design space 2 and performance space 3P as given above (with 
m — n — 2). However, it can easily be extended to higher dimensions. 

Description of the NBI algorithm 



Maximization of a performance p is equivalent to minimization of —p. 
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1. Start with individually minimizing/] and/o as a function of x, i.e., compute 
f**^ = f(x**) = min{fk(x)| X G S'},k = 1,2. 

2. Determine the straight line if in ^ between these points f' and f^. 

3. Determine the vector n normal to this straight line, pointing into the direction of 
decreasing f. 

4. Select A^ points f^. on the line ^ between f*' and f*' : f^ = l^f*' + (1 - 4)r^ 
for some < It < 1 . 

5. For each point f^-, find the point p^. that maximizes the distance along n in the 
direction of decreasing f, and starting in f^-. 

6. The points p^^, 1 < ^ < A'^ are on the Pareto front. 

Notice that in fact the NBI method consists of A^ subproblems in which a ID 
line search optimization problem has to be solved. Clearly, several modifications 
to the above approach can be made to speed up the calculations. However, we will 
not go into details here. 



5.5 Reverse Modeling Using Transistor Level Simulations 

The NBI algorithm described in Sect. 5.4 was tested by using a simplified ana- 
lytical model of an LNA [8]. However, to obtain an accurate representation of 
trade-offs between competing performances, a transistor level simulator has to be 
used. In principle, NBI can directly drive a circuit simulator by requiring data 
samples needed in the optimization process. However, this is nearly infeasible in 
practice, because of the large amount of simulations needed (a typical order of 
magnitude is 10"*, with a cost of about 2-3 minutes per simulation). A more 
applicable idea is the usage of a forward model as described in Sect. 5.3. However, 
the final accuracy of the Pareto-front may be poor even when surrogate models of 
behavioral parameters are fairly accurate, due to the error amplification process 
described in Sect. 5.3.1. 

A better solution relies on the direct approximation of circuit performances, as 
explained in the introduction. With this approach, no error propagation from the 
behavioral parameter space to the performance space occurs. A simple precom- 
puted lookup table, which is linearly interpolated when NBI requests a new per- 
formance evaluation, will be used to demonstrate the validity of this approach. 

Despite of its simplicity, linear interpolation is a robust way to approximate a 
function, which is doable even in high dimensional spaces. Moreover, a lookup 
table is generated at the only cost of circuit simulations. There is not additional 
cost due to optimization of surrogate model parameters (e.g. training of a neural 
network and number of layers and neurons per layer optimization [4, 6, 7, 8]). 
Linear interpolation occurs when a new sample of the performance has to be used 
in the Pareto front computation and not in an earlier stage. 
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Fig. 5.7 Comparison between Pareto-fronts obtained with a lookup table and with a forward 
model (trade-off between voltage gain Ay and linearity IIP2 of an LNA). Left: design space. 
Right: performance space 

5.5.1 Numerical Results 



In this section we show an example comparing forward model and lookup table 
methods to generate a Pareto-front (Fig. 5.7) with the NBI method. Similarly to 
Fig. 5.3, we show the trade-off between voltage gain A,, and linearity IIP2 with 
respect to the (normalized) design parameters W,, and L„,„ (other design parameters 
are kept constant to convenient values). However, we remark that results in 
Fig. 5.3 were generated by relying on a simplified LNA analytical model [7], 
whereas here both forward model and lookup table are based on transistor-level 
circuit simulations. Details about surrogate models exploited in performance 
computations are given in [4]. The lookup table is a regular grid of 100 x 100 



It is worthwhile to note that only 100 circuit simulations corresponding to different values of 
W need to be performed to generate the table, since the variability of L,„ is taken into account in 
the analytical performance equations, being L,„ an element of the vector y introduced in Sect. 5.1. 
Specific LNA performance equations are not included in this paper, because they can be found in 
any relevant text-book. 
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points. However, it is evident that the number of circuit simulations for each 
design parameter taken into account, has to be reduced as the dimensionality of the 
considered design space is increased. 

It is readily seen from Fig. 5.3 that the forward model based on surrogate 
models of admittance function, returns a 'sparse' set of points which fail to 
approximate the Pareto-front. Indeed such inaccurate results could have been 
expected from the discussion developed in Sect. 5.3.1. On the other hand, the 
forward model based on the performance lookup table provides a reasonable front. 

Finally, we note that in principle the error amplification effect discussed in Sect. 
5.3.1, could be negligible if surrogate models had an error sufficiently lower than 
ones in [4]. However, in practice this is not easy to achieve. Moreover, how much 
accurate the models should be, and in which part of design space, are still open 
issues. 



5.6 Discussion and Conclusions 

This paper has reviewed forward and reverse modeling concepts of RF circuit 
blocks, referring to a narrowband LNA. 
A number of conclusions can be drawn: 

• In principle forward models can be exploited to speed-up the computation of 
Pareto-fronts (i.e. reverse modeling), but the accuracy can be limited by error 
amplification from the surrogate models of behavioral parameters (admittances 
and noise functions) to circuit performances. 

Direct modeling of circuit performances solves the problem of error amplifi- 
cation from behavioural parameters to performances (Sect. 5.3.1). 

• NBI optimization is proven as a promising approach to investigate LNA per- 
formance trade-offs. 

• Forward models based on performance lookup tables (and linear interpolation) 
are likely to be sufficient for obtaining accurate Pareto-fronts when only two 
design parameters are taken into account in the forward model. 

However, an LNA design problem may have ten or more design variables. 
Therefore, possible limitations of the uniformly filled lookup tables are: 

• Uniform sampling of performances would require a too large number of samples 
because of the oversampling of the smooth regions, where sensitivity of per- 
formances as functions of design parameters is modest. It is expected that 
adaptive sampling techniques are needed when more design parameters are 
considered [1, 4, 17]. 

• Interpolation of the resulting large lookup tables can be computationally less 
efficient than modeling such a large number of design variables (which on the 
other hand is technically much more challenging). 

Some important points have to be investigated further: 
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• How does the lookup table approach scale to more design parameters? 

• How can the lookup table be filled and processed in an efficient and accurate 
way? 

• How accurate should a forward model be in order to be successfully used in 
reverse modeling? 

Finally, the surrogate modeling approach implemented in [17] will be directly 
applied to performances, comparing the accuracy with the table-based method. 
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Chapter 6 

Recycling Krylov Subspaces for Solving 
Linear Systems with Successively 
Changing Right-hand Sides Arising 
in Model Reduction 

Peter Benner and Lihong Feng 



Abstract We discuss the numerical solution of successive linear systems of 
equations Ax ~ bj, i = 1,2, . . .,m,hy iterative methods based on recycling Krylov 
subspaces. We propose various recycling algorithms which are based on the 
generalized conjugate residual (GCR) method. The recycling algorithms reuse the 
descent vectors computed while solving the previous linear systems Ax — bj, 
j— 1,2,...,;— 1, such that a lot of computational work can be saved when 
solving the current system Ax = b,. The proposed algorithms are robust for solving 
sequences of linear systems arising in circuit simulation. Sequences of linear 
systems need to be solved, e.g., in model order reduction (MOR) for systems with 
many terminals. Numerical experiments illustrate the efficiency and robustness of 
the proposed method. 

Keywords Linear systems of equations • GCR • GMRES • Recycling Krylov 
subspace 



6.1 Introduction 

In some fields of circuit design, such as macromodeling of linear networks with 
many terminals [2], noise analysis of RF circuits [19] etc., successive linear sys- 
tems of equations A,x = bi,i = 1,2, . . .,m, have to be solved. Here the dimension 
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n of the unknown vector x G M" is very large. In general, the A,-, i — 1,2, . . .,m, 
form a set of different, nonsymmetric matrices. In some applications [2], only the 
right-hand sides bf vary while A, s A is fixed, which can and should be exploited. 
In this paper we focus on the latter case, i.e. 

Ax — bj, i—\,2,...,m. (6-1) 

If a direct method like a sparse LU or Cholesky decomposition is applicable, 
then the solution of the linear systems (6.1) can be obtained by sharing the 
LU/Cholesky decomposition of A and using successive forward/backward solves. 
Whether or not a sparse direct solver is applicable depends on the size of the 
matrix, its sparsity pattern and adjacency graph which is highly application- 
dependent. For details see [3]. Here, we are interested in cases where the coeffi- 
cient matrix A does not allow the efficient application of a sparse direct solver. In 
this situation, iterative methods are to be used. For this purpose, any standard 
solver can be applied, e.g., depending on the structure of A, the generalized 
conjugate residual (GCR) method, the generalized minimized residual (GMRES) 
method, the quasi minimal residual method (QMR), the preconditioned conjugate 
gradient (PCG) method are possible choices (see, e.g., [17] for hints on the choice 
of method). Doing this in a naive way, the chosen method is started anew for each 
right-hand side vector bj. Since the linear systems of equations in (6.1) share the 
same coefficient matrix A, we try to exploit the information derived from the 
previous system's solution to speed up the solution of the next system. Recycling 
is one possible choice here which turns out to be a suitable approach for the 
applications in MOR we have in mind. Due to a quite easy mechanism for recy- 
cling, we focus here on GCR. That is, the recycling algorithms proposed in this 
paper reuse the descent vectors in GCR for the solution of the previous systems as 
the descent vectors for the solution of the current system. This way, the compu- 
tation of many matrix-vector products can be saved as compared to restarting 
GCR for each subsystem. The algorithm has the same convergence property as the 
standard GCR method, that is: for each system, in exact arithmetic the approxi- 
mate solution computed at each iteration step minimizes the residual in the affine 
subspace comprised of initial guess and descent vectors. 

Note that very recently, other Krylov subspace methods have been suggested for 
recycling. We comment on GMRES-based recycling in Sect. 6.4 and have further 
investigated this in [9]. CGS- and BiCGStab-recycling is suggested in [1], but the 
investigation of these new variants for our purposes is beyond the scope of this paper. 

However, the number of the recycled descent vectors may become too large if 
there are many systems in the sequence. If the size of A is very large, then storing 
and computing all the descent vectors requires very large memory and also slows 
down the speed of computation due to orthogonalization needs. Therefore, we 
propose a restarting technique which limits the number of recycled descent vectors 
so that the required workspace can be fixed a priori. 

Conventional block iterative methods (see [12, 18] and references therein) 
cannot be applied to (6.1) if the bj are supplied successively (like in the MOR 
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applications to be discussed), since these methods require that the right-hand sides 
are available simultaneously. A new method for solving systems A,.*:, = /?, is 
proposed in [16], where the focus is on varying A,-. This approach turns out to be 
quite involved for systems with the same coefficient matrix A as in (6.1). The 
proposed algorithms are simpler and easier to implement. 

In the next section, we propose various recycling algorithms based on standard 
GCR. We discuss the convergence properties of these algorithms. In Sect. 6.3, we 
review the MOR method from [2] for circuits with massive inputs and outputs. We 
show where sequences of linear systems as in (6.1) arise in this context and how 
the recycling algorithms are applied. In Sect. 6.4, we show simulation results of 
the proposed algorithms and the standard GCR method when applied to MOR of a 
circuit with numerous inputs and outputs. The recycling algorithms are compared 
with GCR by the number of matrix-vector products needed for solving each 
system. We also compare recycled GCR with GMRES-based recycling as sug- 
gested in [20]. Conclusions are given in the end. 



6.2 IMethods Based on Recycling Krylov Subspaces 

In this section, we first introduce standard GCR and discuss some of its properties. 
Based on this method, we propose several recycling algorithms for solving 
sequences of linear systems as in (6.1). In addition, we present a convergence 
analysis of one of the recycling algorithms in exact arithmetic. (A finite precision 
analysis is beyond the scope of this paper.) 



6.2.1 The Standard Generalized Conjugate Residual Method 

The standard GCR method is proposed to solve a single system of linear equations 
Ax — b in [5]. GCR belongs to a kind of descent methods for nonsymmetric 
systems, which have the general form as below: 

Algorithm 1 Descent method 

1 . given xi (initial guess), compute r^ — b — Axi ; 

2. sei p\ — ri\ k — \\ 

3. while ||r^,|| > toldo 

4. a, - (''^'"-' ■ 



(AptAptY 

5. Xk+i =Xk + atPk, 

6. rk+\ = rk- dkApk; 

7. compute /7^+i; 

8. k^k+l; 

9. end while 
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The parameter di^ is chosen to minimize \\rk^\ jjj as a function of a, so that the 
residual is reduced step by step. The descent methods differ in computing the new 
descent vector Pk+\- For GCR, the descent vector p^+i is computed so that it 
satisfies (A/j^+i , Apj) = for 1 <_/ < k (see [5] for details; (y, z) — z^y denotes the 
Euclidian inner product in M,,). For completeness, we present a version of standard 
GCR in the following algorithm. 

Algorithm 2 GCR 

impute r\ — b ~ Ax\ ; set ^ = 1 ; 



1. 


given xi (mitial guess 


2. 


while \\rk\\2 > toldo 


3. 


Pk = rk;qk =Apk; 


4. 


foTJ = 1 to A: — 1 


5. 


Pkj = (?*,?/); 


6. 


qk = qk~ Pkjqj 


7. 


Pk=Pk- kjPj 


8. 


end for 


9. 


qk ^qklW'qkWi.Pk ■ 


10. 


'^k = {.rk,qk); 


11. 


Xk+\ =Xk + XkPkl 


12. 


n+i ^n- ^kqk] 


13. 


k^k+l; 


14. 


end while 



-Pk/imh'^ 



It is not difficult to prove that the descent vectors pk in Algorithm 2 satisfy 
{Apk+i,Apj) ~ fory = 1, . . .,A: — this is the effect of the f or-loop which serves 
the purpose of A^A-conjugation of the descent vectors. It is also proven in [5] that 
Xk+i minimizes R{w) — \\b — Awjljover the affine space xi + span{/7i,/?2, • . •,/?*}, 
where span{pi, p2, .. .,pk} constitutes a Krylov subspace, i.e. span{pi,p2, ■ ■ ., 

Pk} = span{pi,A/7i,...,A'^"Vi}- 

In the next subsection, we will show how to employ the descent vectors from 
the previous systems for solving the current system. This way, the standard GCR 
method can be accelerated and many matrix-vector products can be saved when 
solving all linear systems in a sequence as in (6.1). 



6.2.2 Recycling Methods 

In this subsection, we propose three variants of recycling algorithms based on the 
standard GCR method. The idea of recycling descent vectors in GCR is motivated 
by the recycling algorithm in [19], where sequences of linear systems 
(/ + (T,A) = bj, i = 1,2, . . .,m, are solved. Unfortunately, the algorithm in [19] is 
not convergent for some cases, because the computed Xk does not minimize 
R{w) = \\b — Aiw\\2 (A; = I + (jjA) at each iteration step. In the following, we 
present a convergent recycling algorithm: Algorithm 3. We will further show that 
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for each system Ax — bj, x\ generated by Algorithm 3 minimizes R{w) = 
\\bi — Awjlj at step k. 

Algorithm 3 GCR with recycling for Ax = bj,i = 1,2,.. .m. 

1. NumDirecs — 0; (number of descent vectors pj, j ~ 1,2,...) 

2. fori = 1 to m 

3. given x\ (initial guess for the /th linear system), compute r\ = bi — Ax\; 

4. set k — \; 

5. while 114 II2 > tol do 

6. if ^ > NumDirecs then 

7. Pk = r[; 

8. qk^Apk; 

9. forj = 1 tofe- 1 

11- kk^qk-kjqh 

12. Pk^pk- kjPh 

13. end for 

14. qk = qk/\\qk\\2^Pk^Pkl\\qk\\2\ 

15. NumDirecs — k; 

16. end if 

17- 4 = (r^qk); 

18. x[^^ =x[ + a[pk\ 

19- 4+1 = 4 - 4?*; 

20. k = k+\; 

21. end while 

22. end for 

In Algorithm 3, every system Ax = i>,-, / = 2, 3, . . ., reuses the descent vectors 
Pj computed from the previous systems. For example, in the beginning of 
solving system Ax — b/^, NumDirecs denotes the number of descent vectors 
computed when solving Ax = bi,i ~ 1,2, . . .,k — 1. These descent vectors are 
reused when computing the approximate solution of Ax = b^. If these descent 
vectors cannot produce an accurate enough solution, then new descent vectors 
are computed, followed by the matrix-vector multiplication q^ = Apt in the if 
loop. This way, a considerable number of matrix-vector products together with 
the corresponding conjugations are saved when solving the current system. 
Altogether the amount of computations can be significantly reduced as com- 
pared to starting an iterative solver from scratch for each new system in the 
sequence. Furthermore, Algorithm 3 has similar convergence properties as 
standard GCR as is shown in the following. We first state the results and then 
comment on the proofs. 

Corollary 1 The descent vectors generated by Algorithm 3 are A A-orthogonal, 
that is, {Api,Apj) ~ 0, i ^ j. 
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Corollary 2 For each linear system Ax — bj in (6.1), x^+i generated by Algo- 
rithm 3 minimizes R{x) = \\bi — Axjlj over the subspace x\ + spanjpi ,/72, • • -jP/t}- 

Following the corresponding proof of Theorem 3. 1 in [5], the proof of Corollary 
1 is straightforward. (Essentially, the proof consists of observing that the f or-loop 
in Algorithm 3 is modified Gram-Schmidt A^A-orthogonalization.) Because x^+i in 
Algorithm 3 is constructed analogously to .t^+i in Algorithm 2, and using the proof 
of Theorem 3.1 in [5], Corollary 2 can also be easily proven if the descent vectors 
Pj, j — 1,2, .. .k are A^A-orthogonal, which is guaranteed by Corollary 1. Due to 
space limitation, a complete proof is not provided here. 

In Algorithm 3, the search space is actually composed of several Krylov sub- 
spaces. For example, when solving the second system, the recycled vectors con- 
stitute the Krylov subspace 

^„,(A,r!) = {r!,Ar;,...,A«'-V;}. 

Furthermore, if new descent vectors are computed to obtain the solution of the 
second system, they constitute a second Krylov subspace 

K„,iA,rl^,) = {rl^„Arl^„...,A"^-'rl^,}. 

Here, n,-, i = 1,2 are the number of vectors in the Krylov subspaces. That is, the 
approximation x of Ax — b2 minimizes the residual over 

x^^+K,„{A,'-\)+K,„iA,rl+i), 

which, if analyzed further, is a nested Kjylov subspace since r^ ^j is generated 
using A'„,(A, rj), for details see [4]. 

Generally, for the ith system, the recycled vectors are all the linearly inde- 
pendent vectors from the previous Krylov spaces K„^{A,r\),K„^{A,r^ _^^^), 
Ki,,{A,rl ^^^^^), . . .. Thus, in general, the number of recycled Krylov vectors 
increases with the number of linear systems to be solved. We will see in Sect. 6.4 
that this causes a problem as often, for each new system, some new vectors have to 
be added to attain the required accuracy. Thus, the major drawback of Algorithm 3 
is: if there are a large number of systems in (6.1), the number of recycled descent 
vectors generated may become simply too large. Since all the descent vectors are 
reused for the next system, they all have to be stored till the final system is solved. 
For very large-scale systems, the required memory resources will be massive. 
Moreover, the conjugation part of the algorithm will slow down the algorithm. 
Therefore, we propose Algorithm 4 in which a restarting technique is used. This 
keeps the number of recycled descent vectors within a maximum limit which can 
be based upon the available memory and the dimension of the linear systems. 

Algorithm 4 Algorithm 3 with restarts. 

1. NumDirecs = 0; (number of descent vectors pj, j — 1,2,...) 

2. for/ = 1 tow 
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3. if NumDirecs > MaxDirecs 

4. NumDirecs — 0; 

5. end if 

6. perform steps 3.-21. of Algorithm 3; 

7. end for 

Algorithm 4 is equivalent to Algorithm 3, except for the first if -branch in the 
beginning of the algorithm. In the algorithm, MaxDirecs is the maximum number 
of recycled descent vectors, and the recycling is restarted whenever MaxDirecs is 
reached. When recycling is restarted, the algorithm is identical with standard GCR. 
This restart technique is somewhat heuristic, because the optimal MaxDirecs is 
difficult to determine. Here "optimal" is meant in the sense that the overall number 
of matrix-vector products is minimized. However, from experimental experience, 
the algorithm requires the least matrix-vector products if MaxDirecs is set to be a 
little larger than the actual number of iteration steps needed by standard GCR for 
solving a single system. 

Since Algorithm 4 reduces to the standard GCR method whenever it restarts, no 
vector is recycled for the current system. As information is wasted this way, this 
can be considered as a drawback of the algorithm. In the following Algorithm 5, 
we propose another possibility of controlling the number of recycled vectors in 
Algorithm 3. Defining M to be the number of recycled vectors used in each (except 
for the first) system, we then use the M descent vectors which are computed when 
solving the first system as the recycled vectors for each of the following systems. 
Thus, each system is guaranteed to be able to recycle certain vectors. The second 
advantage of Algorithm 5 over Algorithm 4 is that M can be set much smaller than 
MaxDirecs; therefore it can outperform Algorithm 4 by saving computations in the 
orthogonalization process (Step 10-14 in Algorithm 3). 

Algorithm 5 Algorithm 3 with fixed number of recycled vectors. 

1. for/ = I torn 

2. if / = 1 

3. NumDirecs — 0; 

4. else 

5. NumDirecs — M; (number of fixed recycled vectors) 

6. end if 

7. given x\ (initial guess for the /th linear system), compute r\ — bi — Ax\ ; 

8. Jfc=l; 

9. while 114112 > tol do 

10. if ^ > NumDirecs 

11. perform steps 7.-14. of Algorithm 3; 

12. end if 

13. perform steps 17.-20. of Algorithm 3; 

14. end while 

15. end for 
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Algorithm 4 is the restarted version of Algorithm 3, therefore the recycled 
vectors also constitute a union of several Krylov subspaces. Once the algorithm 
restarts, new recycled Krylov subspaces will be generated. Algorithm 5 always 
recycles the same search space, which is generated when solving the first linear 
system(s) (in case M or more vectors are needed for solving Ax — by, these are 
taken from /ir„|(A,r}), otherwise some Krylov vectors from the next Krylov 
subspaces cire also included). 

It is known that GCR may break down in some cases. However, if the coeffi- 
cient matrix is positive real, i.e., A + A^ is positive definite, then it can be shown 
that no breakdown occurs [5]. In the following section, we will discuss this issue 
for those linear systems of equations that we are interested in, that is, system 
matrices A arising in the context of model reduction algorithms for circuit 
simulation. 



6.3 Application to Model Order Reduction 

In this section, we review the model order reduction (MOR) problem for linear 
systems. Based on this, the method SPRIMA [2] for MOR of circuits with massive 
inputs and outputs is briefly described. We show that in SPRIMA, successive 
systems of linear equations as in (6.1) need to be solved. 

MOR [7, 15] has been widely applied to the fast numerical simulation of large- 
scale systems in circuit design. Using MOR, the original large dimensional system 
is approximated by a system of (much) smaller dimension. If the approximation 
error is within a given tolerance, only the small system needs to be simulated, 
which will in general take much less time and computer memory than the original 
large system would do. Here, we consider linear, time-invariant descriptor systems 
of the form 

Cx{t) + Gx{t) = Bu{t), y{t) = L^x{t) (6.2) 

with system matrices C, G G K"''",5 G R"''^ and L e M"'"'. Here, x{t) e R" 
denotes the unknown descriptor vector of the system, while u{t) and y{t) stand for i 
inputs and r outputs, n is the order of the system. Conventional moment-matching 
MOR methods, as, e.g., PRIMA [15] derive the reduced-order model by com- 
puting a projection matrix V having orthonormal columns and satisfying 

range{y} = span{(soC + Cy^B, -{sqC + G)"'C(soC + Cy^B, 
...,{~(soC + Gy'cY{sQC + Gy'B}, p^n. 

The reduced-order model is obtained by using the approximation x x Vz, leading 
to the projected system 

G(r) + Gz{t) = Bu{t), y{t) = L^z{t), (6.4) 
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where C = V'^CV, G = V^GV e R«^«,5 = V^B e W'',L = V'^L e W'. The 
matrix V S R"^^ with q <^ n depending on p and i satisfies V^V — I. The new 
descriptor vector is z{t) G R^. 

If there are many inputs and outputs in (6.2), conventional MOR methods as the 
one described above are inefficient in obtaining a reduced-order model. It is not 
difficult to see from (6.3) that each term in the Krylov subspace is given as a 
matrix with / columns, where I is the number of inputs as well as the number of 
columns in B. If the number of linearly independent columns in B is very large, 
then the number q of columns of the projection matrix V increases very quickly. 
This may cause that a system fails to be reduced, the dimension of the reduced- 
order model can become as large as that of the original system. 

Several methods have been proposed for MOR of systems with massive inputs/ 
outputs, including the ones discussed in [2, 8, 13, 14]. One of the newly proposed 
methods is SPRIMA [2]. When there is a large number of inputs, SPRIMA reduces 
the original system (6.2) by decoupling it into / single-input systems. The basic 
idea is described in the following. Based on the superposition principle for linear 
systems, the output response of the multi-input multi-output system (6.2) can be 
obtained by the sum of the output responses of the following single-input systems: 

fori^ l,2,...,f, (6.5) 

yiit)^L'xit), 

i.e., y{t) ~ yi (t) + y2{t) + ■ ■ ■ + y('{t), with bj being the ith column of B. 

Since PRIMA [15] is efficient for MOR of single-input systems, by applying 
PRIMA to each single-input system in (6.5), we can get approximate output 
responses y\,y2, ■■•,>'/ of yi,y2, ■ ■ ^ye from the reduced-order models of the sin- 
gle-input systems 



Cz{t) + Gz{t) = biu{t), 



^ ^^ for/ ^1,2,...,! (6.6) 



The output response of system (6.2) is thus approximated by 3'(/) « vi (f) + 

y2{t) + ---+yi{t). 

Based on the above considerations, we can see that by decoupling the original 
system with numerous inputs into systems with a single input, SPRIMA [2] 
overcomes the difficulties of PRIMA. As can be seen from (6.6), we actually create 
several reduced-order models, but the final output of the original system can be 
computed by the superposition of all the reduced-order models. Moreover, the final 
transfer function of the original system can also be represented by the transfer 
functions of the reduced models [2]. 

As has been introduced in (6.3), for the /th single-input system in (6.6), PRIMA 
requires the matrices V, to be computed as 

rangejy,} = span^^oC + G)"'/?,-, -{sqC + G)"'C(ioC + G)"'/?,-, 

. . ., (-(^oC + G)-'cY-{sqC + G)-'bi}. (6.7) 
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It is clear from (6.7) that in order to compute Vi, i — I, . . .,£, we have to solve 
X]i=iPi + ^ systems of linear equations with identical coefficient matrix sqC + G, 
but different right-hand sides. In [2], it is proposed to use the (sparse) LU 
decomposition sqC + G to solve systems of medium dimension. For very large- 
scale systems, standard iterative solvers are employed to solve them one by one. 
For circuit systems with numerous inputs/outputs, thousands of such systems are 
produced during the MOR process, the computational complexity in solving them 
increases much too fast. Therefore, efficient computational algorithms for solving 
successive systems of linear equations are very important to improve the whole 
MOR process. Thus, we propose to use the recycling algorithms proposed in 
Sect. 6.2 to solve the sequence of systems arising from (6.7). 

Please note that for each single-input system in (6.5), MOR methods other than 
PRIMA can also be employed to derive the reduced single-input systems in (6.6). 
For example, one-sided rational Krylov methods [11] (which allow multiple 
expansion points) are possible here. 

It remains to discuss the breakdown issue in GCR for the linear systems of 
equations encountered here. If RLC circuits are modeled by modified nodal 
analysis (MNA), the matrices in (6.5) have the structure 



C = 



where Gi,Ci are symmetric positive semidefinite and C2 is symmetric positive 
definite [10]. As a consequence, both G, C and thus A — sqC + G have a positive 
semidefinite symmetric part for any real ^o- (Similar conclusions can be drawn for 
complex sq using complex conjugate transposes.) Unfortunately, the no-break- 
down condition in [5] requires A to be positive real, i.e., its symmetric part to be 
positive definite. But from the structure, we can only infer that the symmetric part 
of A is positive semidefinite. Although we have not encountered breakdown in any 
of the numerical examples performed for RLC circuit equations, the possibility of 
breakdown cannot be excluded. As we are fairly close to the necessary positive 
realness condition for A, there is hope that breakdown will only be encountered in 
pathological cases. Moreover, in many situations (if the circuit neither contains 
voltage sources nor a cutset consisting of inductances and/or current sources 
only, see [6]), it can be shown that sqCi + Gi is positive definite; in these cases, 
A — sq C + G will be positive real and thus, breakdown cannot occur. 



6.4 Simulation Results 

In this section, we illustrate the efficiency of the proposed recycling algorithms 
when used within SPRIMA. As a test case, we employ an RLC circuit with 
i — 854 inputs and r = 854 outputs. The dimension of the system is fairly small. 
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Fig. 6.1 Algorithm 3: J of matrix-vector products, a All systems and b all but 1st system 

n = 1,726, but it allows to demonstrate the effects of recycling. The model was 
generated with the circuit simulator TITAN. ' 

We compare Algorithms 3-5 with the standard GCR method (Algorithm 2). If 
we set pi = 19 in (6.7), then m = (pi + 1)£ = 20 x 854 = 17,080 systems of 
linear equations have to be solved in order to obtain the projection matrices 
Vi, i = 1,2, ...,£. In this case, p, — 19 is necessary to obtain the desired accuracy 
of the reduced-order model. As for a preconditioner, we employ an incomplete LU 
factorization with threshold 0.01 (in MATLAB® notation: ilu(A, 0.01)) within 
all algorithms. 

We first compare standard GCR with Algorithm 3 by solving the first 3,000 
systems. (Similar results for all the systems are also available.) The number of 
matrix-vector (MV) products for solving each system is plotted in Fig. 6.1. The 
number of MV products required by Algorithm 3 is much less than that of GCR. 
Only very few MV products are required after solving the first several systems. 
Starting from the 1,000 th system, there are almost no MV products necessary. 

Figure 6.1 suggests that Algorithm 3 is much more efficient than GCR. How- 
ever, this would only be true if n would be significantly larger than the number of 
systems to be solved (which is not the case in the example considered here). 
Figure 6.2 illustrates that in this example, the number of recycled vectors reaches 
n before all systems are solved so that storing all recycled vectors need a full 
n X n array which is not feasible for larger problems. In order to solve this 
problem, we propose to use Algorithm 4 or Algorithm 5. 

Figures 6.3 and 6.4a compare the MV products required by GCR with the 
corresponding numbers resulting from Algorithm 4 and Algorithm 5, respectively. 
Figure 6.3 shows that although Algorithm 4 needs more MV products than 
Algorithm 3, it is applicable to much larger systems as the number of recycled 
vectors remains limited. Figure 6.4a illustrates that Algorithm 5 can save MV 



TITAN was then developed by Qimonda AG, Neubiberg (Germany); after insolvency of 
Qimonda in 2009, TITAN is now owned by Infineon Technologies AG, Neubiberg. 
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products for each system, whereas Algorithm 4 cannot save MV products for all 
the systems as is shown in Fig. 6.3, because Algorithm 4 falls back on GCR 
whenever it restarts. One can also conclude from Fig. 6.4b that the number of 
iteration steps used in Algorithm 5 is almost the same as that of GCR for solving 
each system. Notice that Algorithm 5 uses M = 15 recycled vectors for each 
system; therefore, it actually saves M = 15 MV products for each system in the 
standard form as 1 MV product is computed per iteration. However, if incomplete 
LU preconditioning is used, there will be three MV products at each iteration step 
in GCR: one is due to the coefficient matrix itself, the other two are due to the 
backward/forward solves employed by the preconditioner. Therefore, M = 15 
iteration steps actually lead to 45 MV products. 

In Table 6.1, we list the MV products saved by Algorithms 4 and 5 for solving 
several groups of systems of linear equations. It is clear that the recycling algo- 
rithms can save many MV products when solving long sequences of systems, and 
therefore can improve the overall efficiency of MOR. 

In order to show the accuracy of the reduced-order model obtained by applying 
the recycling algorithms, we compare the simulation results of the reduced-order 
model with those of the original model in Fig. 6.5a, b. We use step functions at all 
the terminals as the input signals and observe the output response at the first 
terminal. The reduced model is derived by using Algorithm 5. The output 
responses of the reduced model and the original system at the first terminal are 
plotted in Fig. 6.5a and the maximum relative error between them is 4.8 x 10~ . 



Table 6.1 Number of MV products saved by Algorithms 4 and 5 
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Fig. 6.5 Accuracy of the reduced model, a Comparison of the output response and b comparison 
of the transfer function 
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We have similar accuracy for the output signals at the other terminals. 
The magnitudes of the transfer functions for both systems are shown in Fig. 6.5b. 
The transfer function shown here is the mapping from the input signal at the first 
terminal to the output response at the same terminal. The relative error (computed 
point- wise at 201 frequency samples) of the magnitude of the reduced transfer 
function is around 10~' , therefore the two waveforms are indistinguishable in the 
figure. The accuracy of the transfer functions at any other two terminals is also 
acceptable, with errors usually smaller than 10~^. 

A new recycling method based on GMRES is proposed in [20]. The basic idea 
of recycling in this method, called MKR_GMRES, is quite similar to Algorithm 3. 
The main difference is that MKR_GMRES uses GMRES as the basic algorithm for 
recycling, the recycled vectors are the orthogonal basis vectors generate by the 
underlying Arnoldi process. When solving the /th system, the Amoldi vectors 
generated by all the previous linear systems are reused to generate a "better" 
initial solution. If the initial solution is not accurate, new Arnoldi vectors are 
computed and the solution is generated in the augmented Krylov subspace spanned 
by all the Arnoldi vectors. Because all the orthogonal Arnoldi vectors are recycled, 
therefore after solving a certain number of systems, the number of new Arnoldi 
vectors becomes less and less. This can be observed from the number of MV 
products used for solving each linear system (illustrated in Fig. 6.6a). It has a 
similar behavior as Algorithm 3 which recycles all the previously computed 
descent vectors. The method also has a similar problem as Algorithm 3, that is the 
dimension of the recycled Krylov subspace increase fast, see Fig. 6.6b. Since each 
new Arnoldi vector has to be orthogonalized to all the previous ones, the 
orthogonalization process becomes slower and slower, which affects the efficiency 
of the whole algorithm. Although Algorithms 4 and 5 are somewhat heuristic, 
they can remedy the the weakness of Algorithm 3, and also work quite well for 
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certain systems. It is also possible to construct recycling method based on nested 
GMRES. Here, "nested" means the GMRES method is nested inside some other 
iteration solver and contributes to the inner iterations. We have studied this 
recycling technique in [9]. In this paper we have applied the algorithm to para- 
metric model order reduction in system level simulation for Microelectrome- 
chanical systems (MEMS). The outer iteration of the recycling method is GCRO 
proposed in [4], and the inner iterations are done by GMRES. The recycled sub- 
space is generated by Harmonic Ritz vectors of the coefficent matrix A and does 
not inflate with the number of linear systems as happens in the algorithms here. 
The original version of the recycling algorithm in [9] is proposed in [16], where 
the theoretical background of the method is analyzed in detail. 



6.5 Conclusions 

We have proposed variants of recycling descent vectors in GCR for solving 
sequences of linear systems with varying right-hand sides. The recycling methods 
are efficient for solving sequences of large-scale linear systems successively. The 
methods discussed here are limited to solving systems with identical coefficient 
matrix and different right-hand sides. The techniques proposed for controlling the 
number of recycled vectors are heuristic. Further research will focus on more 
robust recycling techniques (such as thick restarting) and on solving systems with 
different coefficient matrices. 
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Chapter 7 

Data-Driven Parameterized Model Order 
Reduction Using z-Domain Multivariate 
Orthonormal Vector Fitting Technique 

Francesco Ferranti, Dirk Deschrijver, Luc Knockaert and Tom Dhaene 



Abstract Efficient real-time design space exploration, design optimization and 
sensitivity analysis call for Parameterized Model Order Reduction (PMOR) 
techniques to take into account several design parameters, such as geometrical 
layout or substrate characteristics, in addition to time or frequency. This chapter 
presents a robust multivariate extension of the z-domain Orthonormal Vector 
Fitting technique. The new method provides accurate and compact rational para- 
metric macromodels based on numerical electromagnetic simulations or mea- 
surements in either frequency-domain or time-domain. The technique can be seen 
as a data-driven PMOR method. 

Keywords Parametric Macromodels • Vector Fitting • Approximation Algorithm • 
Rational Functions • Least-Squares 



7.1 Introduction 

Nowadays, full-wave electromagnetic methods [9, 11, 17] are widely used to sim- 
ulate a variety of complex electromagnetic systems and are considered to be essential 
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for efficient design. The use of these methods usually results in the computation of a 
huge number of field (E,H) or circuit (i,v) unknowns, in the frequency-domain or 
time-domain, although users are usually only interested in a few of them at the input 
and output ports. These methods provide high accuracy, often at a significant cost in 
terms of memory storage and computing time. Therefore, Model Order Reduction 
(MOR) techniques are crucial to reduce the complexity of the model defined by the 
full-wave numerical method and the computational cost required by simulations, 
while retaining the important physical features of the original system [3,7]. 

Efficient real-time design space exploration, design optimization and sensitivity 
analysis require the development of accurate parametric broadband macromodels 
that approximate the dynamic behavior of a system characterized by several design 
parameters, such as geometrical layout or substrate characteristics, in addition to 
time or frequency. These applications call for Parameterized Model Order 
Reduction (PMOR) techniques. 

A frequency-domain technique called Multivariate Orthonormal Vector Fitting 
(MOVF) was presented in [4], to compute accurate rational parametric macromodels, 
based on parameterized frequency responses with a highly dynamic behavior. This 
technique can be seen as a data-driven PMOR method. Instead of reducing the size of 
the matrices of a parameterized state-space model directly {model-based PMOR), 
MOVF builds rational parametric macromodels with a reduced model complexity 
based on a set of input-output data samples. The goal of the macromodeling algorithm 
is to find a multivariate rational function which approximates a large set of A" + 1 data 

samples { {s, g)^, , H{s, g)^^ } , _„ in a least-squares sense. These data samples depend on 

the complex frequency s — j(o and several additional parameters g — (g'"^)„^[ as 
design variables which describe e.g. the metallizations in an EM circuit (lengths, 
widths, . . .) or the substrate features (thickness, dielectric constant, losses, . . .). The 
proposed approach results in accurate and compact rational parametric macromodels 
of complex electromagnetic systems. A generalization of MOVF to include parameter 
derivatives in the modeling process was proposed in [6] . Parameter derivatives provide 
additional information about the underlying system and can often be simulated at a 
significantly lower computational cost than additional samples [5, 13, 15, 20]. The 
inclusion of derivatives can be useful to reduce the required amount of data samples, 
while preserving the accuracy of the results. In this chapter a new technique, the 
z-domain Multivariate Orthonormal Vector Fitting (ZD-MOVF) is described, repre- 
senting the z-domain counterpart of [4]. It is a robust multivariate extension of the z- 
domain Orthonormal Vector Fitting technique (ZD-OVF) proposed in [14, 16, 21]. A 
microstrip example confirms the ability of the new algorithm to build parametric 
macromodels of dynamic systems with a good accuracy. 

7.2 Background 

In this section we explain the generation of z-domain data starting from frequency- 
domain or time-domain data and the choice of the 1 parameter of the Tustin transform. 
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7.2.1 Generation of z-Domain Data 

Microwave circuits and components can be characterized in frequency-domain or 
time-domain by numerical electromagnetic simulations or measurements. To 
obtain the corresponding parameterized z-domain response, HJ^z, g), where z is 
the complex discrete frequency variable and g is a real design variable, a Tustin 
(bilinear) transform: 

s-^z^i^, AeM+ (7.1) 

A — s 

can be used starting from frequency-domain data H^.(s, g): 

H,is,g)^H,(^z = ^j^^,g'j (7.2) 

where c stands for continuous and d for discrete. If time-domain data is available, 
under the hypothesis of a negligible or absent aliasing in the sampling process, the 
frequency response of a continuous-time system can be computed by applying 
standard techniques, such as Fast Fourier Transform (FFT) algorithms on the data 
samples: 

hci{[ft],g) = hc{nTs,g) (7.3) 

where the real sequence h^dn], g) is equal to the signal in the time domain 
/Zf (f, g) at the equally spaced time samples nT^ and T^ is the sampling period. Once 
the parameterized frequency response is computed, the Tustin transform (7.1) is 
used as before. The obtained z-domain data can be normalized by discrete fre- 
quency z [14]. Once the parametric macromodel is computed in the z-domain, it 
can be converted back to the s-domain by using inverse Tustin transform. 

7.2.2 Choice of k of the Tustin Transform 

The i parameter of the Tustin transform can be freely chosen [19] under the 
constraint that it is not a real pole of the continuous-time system [1]. The 
numerical example in this letter shows that the algorithm is robust with respect to 
an arbitrary choice of i, since its value does not affect the accuracy of the results 
over a wide range. To avoid harmful numerical conditions, extreme values of 1 
have to be discarded, such as very low (near zero) or very high (near infinity). 



7.3 Parametric IMacromodeling 

To simplify the notation, the algorithm is only described for bivariate systems. The 
extension to the full multivariate formulation is straightforward. As in [4], 
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the ZD-MOVF algorithm proposes to represent the parametric macromodel as the 
ratio of a bivariate numerator and denominator 

where P and V represent the maximum order of the corresponding basis functions 
Xp{z) and \l/y{g) in the complex discrete frequency variable z and the real design 
variable g, respectively. To establish the coefficients Cp^ and c,„, of numerator and 
denominator in (7.4), the ZD-MOVF algorithm minimizes the Sanathanan- 
Koerner (SK) cost function [18] on a set of A" + 1 data samples {(z, g)^, 

Hd{z,g)i^}ii^Q- SK is an iterative procedure, in the first iteration step of the algo- 
rithm (t — 0) Levi's cost function [12] is minimized to obtain an initial estimate of 
the coefficients Cpy and Cpv In the following steps (t — 1, . . .,T) of the SK iteration, 
the inverse of the previously estimated denominator D'^'~^\z,g) is used as an 
explicit least-squares weighting factor. A relaxed non-triviality constraint is added 
as an additional row in the system matrix [8], to avoid the trivial null solution and 
improve the convergence of the algorithm. Each equation is split in its real and 

imaginary parts, to ensure that the model coefficients Cpl, Cpi are real. Scaling each 
column to unity length [7] is suitable to improve the numerical accuracy of the 
results. 



7.4 Choice of Basis Functions 

In this section we describe the choice of the basis functions for the discrete 
frequency and other parameters. 



7.4.1 Discrete Frequency-Dependent Basis Functions 

Based on a prescribed set of stable poles a — {— flp} [, a set of partial fractions 
Xp{Zi a) is chosen, with Xoiz) = 1. To select the poles two steps are followed: first, 
they are chosen in the s-domain as complex conjugate pairs with small real parts 
and the imaginary parts linearly spaced over the frequency range of interest [7] and 
after that, the Tustin transform (7.1) is applied to map them from s- to z-domain. 
A linear combination of two fractions is chosen to ensure that the residues of 
~lp{z, a) and Xpj_i (z, a) come in perfect conjugate pairs leading to real-valued time 
domain responses, i.e.: 

Xp{z, a) == z{z + apY^ + z{z + a,,+i)"' (7.5) 



7 Data-Driven Parameterized Model Order Reduction 145 

Xp+i{z,a) =jz{z + apy^ -jz{z + ap+iy^ (7.6) 

To improve the numerical stability of the modeling algorithm, the Takenaka- 
Malmquist basis functions [10] can be used, as shown in [16]: 

.,fca).Jlll±<;) /'-f-°^l (7.7) 

\%i z + ai J (z + ap)(z + flp+i ) 

r ^ , /^Tt1+<z\ (l+z)|l + flJ ,^ „, 

\^J[ z + ai J[z + ap)[z + ap+x) 
where 

kp = Vi = ^\/l - l«pl' (7.9) 

The orthonormal basis functions can improve the conditioning of the system 
equations and are less sensitive to the choice of the initial poles. Their use ensures 
a more numerically robust macromodeling procedure [3]. 



7.4.2 Parameter-Dependent Basis Functions 

The parameter-dependent basis functions \l/^,{g,\i) are also chosen in partial 
fraction form as a function of 7,g, hence in rational form. The set of starting poles 
b = {— /7v}y=i is composed by complex pairs with small real parts of opposite sign 
and imaginary parts linearly spaced over the parameter range of interest, provided 
that \jJo{g) = 1. A linear combination of two fractions is used to ensure that 
\l/^:{g,\i) and i/'^^i(g,b) are real functions [4]: 

'/'.(g,b) = ijg + b,)-' - {jg~{K)*)-' (7.10) 

•Av+ifeb) ^jijg + h)-' +j(jg-{b,)*)-' (7.11) 



7.5 Example: Double Folded Stub Microstrip Bandstop Filter 

The double folded stub microstrip bandstop filter [2] under study is shown in 
Fig. 7.1. The substrate is 0.1270 mm thick with a relative dielectric constant e^ 
equal to 9.9. The scattering parameters of the system are simulated by ADS- 
Momentum [1]' in the s-domain and subjected to (7.2), to obtain the corresponding 
parametrized z-domain response. 



Momentum EEsof EDA, Agilent Technologies, Santa Rosa, CA. 
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Fig. 7.1 Geometry of the 
double folded stub microstrip 
bandstop filter [2] 




Fig. 7.2 Magnitude of the 
trivariate macromodels of 5i 
(light grey surface) and 52 1 
(dark grey surface) for 
5 = 0.061 mm 
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Fig. 7.3 Magnitude of the 
trivariate macromodels of 5i 
(light grey surface) and ^21 
(dark grey surface) for 
S = 0.243 mm 




Length [m 



10 
Frequency [GHz] 



The parametric macromodels of scattering parameters ^n and ^21 are built as 
functions of the varying length of each folded segment L G [1.98— 2.40 mm] and 
varying spacing between a folded stub and the main line S G [0.061-0.243 mm] 
over the frequency range (5-20 GHz). The desired model accuracy is set to —60 
dB, which corresponds to three significant digits. The initial data grid for ^n and 
521 is of size 14 x 10 x 22 samples (L, S,freq). The corresponding number of 
poles is chosen 6, 4 and 10 for both scattering parameters. Figures 7.2 and 7.3 
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Fig. 7.4 Histogram: error 
distributions of the trivariate 
macromodels of Si i (light 
grey) and 52 1 (dark grey) over 
226,500 validation samples 
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show the magnitude of the trivariate macromodels of ^n and ^21 for the minimum 
and maximum values of the spacing variable S. 

To compute the macromodels only 4 and 3 iterations of SK method discussed in 
Sect. 7.3 are needed and the maximum absolute error in the initial data grid 
corresponds to —62.84 and —67.54 dB, respectively. To confirm the quality of 
built macromodels a set of validation data samples is computed on a very dense 
grid of size 50 x 30 x 151 samples. The histogram in Fig. 7.4 shows the number 
of validation samples that corresponds to a certain absolute error for both trivariate 
macromodels. Figure 7.4 shows that they have a good overall accuracy and the 
maximum absolute error over all the validation samples is bounded by —60.17 and 
—61.06 dB for Sn and ^21 respectively. The choice of the 1 parameter in the 
Tustin transform (7.1) does not influence the model accuracy over a broad range of 
values [10~^— 10^^]. It confirms that 1 is free to choose and illustrates the 
robustness of the algorithm. 



7.6 Conclusions 

This chapter presents a robust multivariate extension of the z-domain Vector 
Fitting technique [14, 16, 21], for the calculation of accurate and compact para- 
metric macromodels of high-speed components. By combining rational basis 
functions and the Sanathanan-Koerner least-squares estimator, the robustness of 
the method is ensured. An example illustrates the capability of the algorithm to 
model dynamic parameterized frequency responses with a good accuracy. Once 
the multivariate macromodeling process is completed, the resulting scalable 
behavior model can efficiently be employed in real-time design space exploration, 
fast optimization and sensitivity analysis. 
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Chapter 8 

Network Reduction by Inductance 

Elimination 



Mark M. Gourary, Sergey G. Rusakov, Sergey L. Ulyanov 
and Michael M. Zharov 



Abstract A new approach to model order reduction of passive linear circuits is 
proposed. It is based on the successive elimination of small inductances. Simul- 
taneously with the inductance removal additional capacitances are connected to 
neighboring branches to provide first order accuracy of the reduced transfer 
functions. The passivity of the reduced circuit is easily verified. The proposed 
approach can be applied jointly with the widely used node elimination algorithm 
TICER. 



8.1 Introduction 

Model-order reduction (MOR) of linear interconnects is the effective tool to 
decrease computational efforts in the simulation of integrated circuits (IC) [1]. 

Fast MOR of RC interconnect can be achieved by the TICER method [2]. The 
TICER performs the reduction by the successive elimination of circuit nodes. An 
analysis of the TICER in comparison with other approaches can be found in [3]. 
TICER usually does not provide the circuit reduction ratio that is achieved in the 
moment-matching or balanced truncation approaches [4]. Note that in such cases 
the method can be implemented as a preprocessing step [5] that allows to obtain 
the essential speedup of more complicated methods. 

The TICER elimination algorithm cannot be applied to nodes with connected 
inductances. Usually the inductances in IC interconnects are neglected and this is 
considered to be insignificant for the interconnect reduction. But the growth of 
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frequencies and/or power of IC increases the contributions of inductances into 
interconnect response which leads to a notable error caused by their neglect. 
Therefore it is desirable to extend the TICER approach to circuits with small 
inductances. 

There are some proposals to take into account nodal inductances in the elimi- 
nation algorithm. The special case of a node with two RL branches and a number of 
capacitances is considered in [6, 7]. In [8] high order admittances are obtained by 
the nodal elimination followed by optimization-based RLC synthesis procedure. 
Such an approach however seems to be too complicated and time-consuming. Paper 
[5] presents the rules for merging any two branches connected to the eliminating 
node. But the rules do not always save first order terms of Laplace admittances 
which results in an insufficient accuracy in the case of small inductances. One 
particular merging rule for LR-branch (at small L) and C-branch is equivalent to the 
inductance neglect. 

The aim of this paper is the development of a first order elimination-based 
algorithm for circuits with small inductances. We propose an approach based on 
the simultaneous elimination of two circuit variables, i.e. nodal voltage and 
inductance current. The proposed approach generates a reduced circuit where the 
nodal inductance is deleted (shorted), and additional capacitances are connected to 
neighboring branches guaranteeing first order accuracy (Sect. 8.3). The circuit 
passivity can be verified by the criterion of the positive definiteness of the 
capacitance matrix. If the eliminated inductance is coupled with other circuit 
inductances (Sect. 8.4) then the reduced circuit contains couplings between 
inductances and capacitances (LC-couplings). It is shown that the couplings cor- 
respond to additional nonzero entries of the capacitance matrix of the Modified 
Nodal Analysis (MNA). This implies that the simulation of the reduced circuit 
with LC-couplings requires small corrections in the simulator code. The rules of 
inductance elimination in the presence of LC-couplings (Sect. 8.5) are also 
derived. The description of the algorithm (Sect. 8.6) and numerical experiments 
(Sect. 8.7) are presented. 



8.2 Elimination of RC-Node by TICER 

The elimination of an RC node (Fig. 8.1) is based on the expression for the star 
connection transform in the Laplace domain (we label branches by the same 
indexes as their outward nodes) 

- _ yiyj _ (g,- + .vc,-)(gj + -^c;) , . 

y-J-^- G +sC ■ ^ ' 

Here y, = g, + scj, Y„ = J^ies X/ ^ ^" + ^^" ^^^ branch and nodal admittances 
respectively, S„ is the set of the indexes of outward nodes of RC-branches con- 
nected to the node n. 
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Fig. 8.1 Circuit transform 
under tlie elimination of an 
RC-node 





If higher order terms of the Laplace variable s in the Taylor expansion of (8.1) 
are truncated then one obtains expressions for conductances and capacitances, 
which must be added to the circuit after node elimination: 



8igj 



giCj + gjCi gil 



G„ 



Gn 



c 



(8.2) 



The method TICER [2] applies (8.2) without the last term in Cy, and in this case 
all capacitances of the reduced circuit are positive. Thus the circuit passivity is 
provided. However, it has been proved in [9] that in spite of possible negative 
capacitances the application of (8.2) does not lead to the loss of circuit passivity. 
Hence, the exact first order expression (8.2) can be applied to the nodal elimination 
to achieve less error. 



8.3 Inductance Elimination 

Here we consider the case of a single inductance connected to an RC node 
(Fig. 8.2). To obtain the reduced circuit we perform simultaneous elimination of 
two circuit variables: nodal voltage and inductance current. 
Let Wi be the current of the /th branch connected to node n 



Wi 



■yi{Vi-V„), i&Sn. 



(8.3) 



The Kirchhoff Current Low (KCL) equation for node n allows us to obtain the 
inductance current as 



Fig. 8.2 Circuit transform 
under inductance elimination 



•"Iff 



W2 



m L„ n 



y2 



ys 

3 




152 M. M. Gourary et al. 

w,„ = ~Y^ Wj = - ^ yj{vj - v„) . (8.4) 

jes„ jeSn 

The inductance voltage is defined by tiie equation 

v„ - v„ ^ sL„w,n. (8.5) 

Substituting (8.4) into (8.5) we obtain 

v„ ( 1 - sL„ Yn ) = v,„ - sLn ^ yjVj . (8.6) 

yes,, 

The nodal variable derived from (8.6) by neglecting higher order terms is 

v« = v„(l + sL„G„) - sL„ ^ gjVj. (8.7) 

jes„ 

After substitution of (8.7) into (8.3), neglecting higher order terms, and col- 
lecting terms in accordance to nodal voltages we obtain 

Wi = .y;(v; - v,„) + sci{vi - v„,) + s^ Cy(v; - v,), (8.8) 

where S'^ is the set S„ without index ;, and 

Ci = -sLngiGn, Cij = sLngigj. (8.9) 

The expression for the ingoing current of the /th node (8.8) corresponds to the 
following reduced circuit (Fig. 8.2): 

(a) node n is eliminated, and the inductance is shorted; 

(b) old RC branches are saved in accordance with the first term of (8.8); 

(c) new negative capacitances c,- (8.9) are connected to each RC branch to define 
the current corresponding to the second term of (8.8); 

(d) new capacitances Cy (8.9) are inserted between outward nodes of RC branches 
to define currents corresponding to the third term of (8.8). 

Unlike the case of RC node elimination the passivity of the circuit after inductance 
elimination is not secured. So it is needed to test a passivity condition before the 
elimination is accepted. It is known that the circuit is passive if its conductance and 
capacitance matrices are nonnegative definite and the capacitance matrix is sym- 
metric. In our case the conductance matrix is kept unchanged and the capacitance 
matrix is symmetric by the construction. Hence, the criterion for a symmetric matrix 
A to be nonnegative definite is applied to the capacitance matrix, i.e. A,, > 0, 
l^i/l ^ Y^A;,-Aj,j. It is equivalent to the conditions on branch and nodal capacitances 



CfE* > 0, |cf *| < ^cfEWcNEW^ (g 10) 



where c-^^* = Cy 
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Fig. 8.3 RLC-section ^ c, = .u^ 

reducing 




r ^ 



If (8.10) is not valid for the reduced circuit then the elimination must be 
rejected. 

If the inductance L„ has a number of couplings with additional circuit iduc- 
tances then each coupling must be processed in the same way. 

A simple example of an inductance elimination is presented in Fig. 8.3. Con- 
ditions (8.10) for the reduced circuit yield c > 2L/r^. 



8.4 Elimination of Coupled Inductances 

Here we consider the case that the inductance L„ is coupled with some other 
inductance L^.. This coupling is defined by the mutual inductance L„<. (Fig. 8.4). 
In this case (8.3, 8.4) are valid, (8.5) is replaced by 

V„ - V„ = SL„W,„ + sLnkWk (8.11) 

and the equation for the voltage of inductance L^ must be added: 

Uk = sLkWk + sLnkWm- (8-12) 

Then the substitution of (8.4) into (8.11) leads to 

Vn = V,„(l + sLnGn) - sL,,^ gjVj + sLnkWk + 0{s^). (8.13) 

Substituting (8.13) into (8.3) we obtain a first order expression for the nodal 
current containing an additional term in comparison to (8.8) 

Wi = Wi + sOi^i^^)kWk, (8-14) 

where vv, is the right hand side of (8.8), 

G(ijn)k = Lnkgi- (8.15) 

The substitution of (8.4) into (8.12) leads to an expression for the inductance 
voltage containing a similar term: 

Uk = sLkWk + s'^Oijj„)k(Vj - v,n). (8.16) 
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Fig. 8.4 Circuit transform by the elimination of the coupled inductance 



The parameter Oy,„)j^ in (8.14, 8.16) corresponds to the coupling (Fig. 8.4) 
between RC- and L-branches (LC-coupling) that can be defined by the following 
time-domain expressions for the capacitance current (/c) and the inductance 
voltage (mz,): 



ic = C 



duc 
~di~ 







dii 
~dt' 



Ul 



dJL 

' dt 







duc 
dt 



.17) 



Our notation 0(, ,„)^- refers to /cth inductance coupled with the capacitance 
between nodes m, i. As far as we know there are no physical devices with such 
interaction. Formally it can be defined in the context of MNA approach as follows. 

The MNA conductance and capacitance matrices of the RLCK circuit are 



-.MNA 



G / 

~f 



c 



MNA 



C 

©^ L 



.18) 



Here, submatrix / is the incidence matrix for the inductances. Submatrix is 
the zero matrix for physically realizable RLCK circuits. With the Gaussian 
elimination the entries of are replaced by nonzero values. Such nonzero values 
can be defined by LC-couplings 



©rt -^0(,j)i, 0(,j)t 



-9 



ij,i)k- 



.19) 



jes, 



It can be seen from Fig. 8.4 and (8.14), (8.16) that the coupling of an eliminated 
inductance is replaced by LC-couplings with each RC-branch of the eliminated 



node. Due to the symmetry of C 
form 



' the passivity condition can be presented in the 



|0;*|<y^, \&jk\<y/^Cj- 



(8.20) 



Standard SPICE-like simulators cannot process LC-couplings. Note that this 
capability can be easily implemented taking into account the corresponding time- 
domain expressions (8.17) and the MNA capacitance matrix (8.18). 
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8.5 Eliminations Under LC Couplings 

The possible appearence of LC-couplings due to previous eliminations requires to 
analyze the case when the inductance to be eliminated has LC-coupling with any 
RC-branch (Fig. 8.5). 

In this case, (8.5) must be replaced by 

V„~V„ = sL„W,„ + s6(kS)n{vk - V;). (8-21) 

After transformations similar to (8.6-8.8) we obtain 

Wi = Wi + SgiO(u)n{vk - V,) = -iVi + SgiO(kJ),„{vk - Vi) + s{~giO(u)„){vi - V,-). 



..22) 



The current entering node m is defined as 

Wm = ~^Wj = -^Wj + s{-~GnO(k,l)n){vk - Vm) + sGnO(kJ)n{vi - V,„). 

,/6S„ jes„ 



.23) 



The last terms in (8.22, 8.23) represent the currents through the capacitances 

Cki = —Cn ~ gi9{k.l)n, Qm — —Ckm — GnO(k.l)n- (8.24) 

Thus, after elimination of the inductance with LC-coupling the reduced circuit 
contains additional capacitances (8.24) as shown in Fig. 8.5. A special case of an 
elimination can be considered when the inductance is LC-coupled to one (Alh) of 
its adjacent RC-branches. The resulting circuit contains capacitances 



Cki = -C; = giO(k.n)n (i € S'l,), Ck = {gk - Gn)0(k,n)n- 



(8.25) 



These capacitances do not form additional entries of the capacitance matrix 
because shunt capacitances (8.9) are always created under inductance elimination. 




Fig. 8.5 Circuit transform by the elimination of the LC-coupled inductance 
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Fig. 8.6 Successive inductance eliminations transform inductively coupled branches into 
capacitively coupled branches. Here 9 = L^/ri, Ci = L\jr\, C2 = I-,2lr\, Cm = Li2/rir2 



Note that if the circuit contains two coupled inductances then after their 
elimination the reduced circuit contains RC branches only. If all inductances of the 
circuit are eliminated then the reduced RLCK circuit is an RC circuit. An example 
of a reduction of two coupled LR branches by elimination of both inductances is 
presented in Fig. 8.6. 

In the analysis presented above we ignored the nodes with multiple inductances. 
But this case does not require a special analysis because the presented algorithms 
are sufficient to provide the elimination of all small inductances. This can be 
explained in the following way. Since we do not consider DC undefined circuits 
containing inductance loops each connected graph of inductances is a tree that 
always has leaf nodes with only one inductance. Therefore if all inductances of 
that tree are sufficiently small the single-inductance algorithms will sequentially 
eliminate them. 

LC-couplings must be taken into account in RC-node elimination (8.1, 8.2). 

If the mth branch of node n which is to be eliminated is LC-coupled with the /rth 
inductance (Fig. 8.7, m = 1), then the branch current is 

Wm = ymiVm " V„) + sO^„j,)kWk. (8.26) 

The currents of the other RC-branches are defined by (8.3), and the KCL for 
node n is 

sO{„,„)kWk + ^ yj{vj - v„) = 0. (8.27) 

jes„ 

Substituting v„ derived from (8.27) into (8.2, 8.26), and neglecting higher order 
terms, we obtain 

Wi^'^{gij + scij){vi-Vj)-s0^„_„-fk-^Wk, j&S", (8.28) 

^m = ^{gmj + SC„y){v„, - Vj) + sO^„,j,)k " " wk- (8.29) 

jes';: '-'" 

Here gjj, Cjj are defined by (8.2), and the second right hand terms (8.28, 8.29) 
correspond to LC-couplings in the reduced circuit (Fig. 8.7): 
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Fig. 8.7 Circuit transform by node elimination under LC-coupling 



^{m.i)k ^ {m.n)k 



G„ 



(8.30) 



Thus after node elimination the branch coupling parameter is distributed among 
new branches adjacent to node m in proportion to their conductances. 



8.6 Algorithmic Aspects 



Presented above considerations allow us to extend the TICER reduction algorithm 
to include nodes with small inductances. The extention requires to adapt important 
details of the overall TICER algorithm for new types of eliminated nodes. First of 
all it is needed to provide an error control in the reduction algorithm. 

The error due to the first order elimination is defined by the neglected higher 
order terms. For node elimination (8.2) one can evaluate the mth order term as 
^,^"' = 0{\s%X) [3], where t„ = t^*^ = C„/G„ is the RC time-constant of the Mth 
node. In TICER nodes to be eliminated are chosen by the condition t„ <Tmin [3], 
where !„,;„ is evaluated in accordance to the maximum operating frequency of 
interest 1^^^ — ci If max- Factor a is defined by the trade-off between the accuracy 
and the circuit reduction ratio. 

To include the inductance elimination in the general algorithm it is necessary to 
define a nodal time-constant for the nodes with connected inductance. It can be 
shown that mth order terms neglected in the inductance elimination (8.7, 8.8) are 
evaluated as (5;"' = 0(|iT*''| I^'t^^I ), {k + I — m), where zf,'' is defined earlier, 
and T^^ = L„G„ (RL time-constant). Thus, the nodal time-constant in this case can 
be evaluated as the worst-case value 



T„ = max(C,C) 



(8.31) 
for an RC 



This expression can be applied to each node if we assume t„ 
node. 

Another important aspect of TICER that can be extended for inductance 
eliminations is the restriction of the density of the reduced system. It is achieved 
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by introducing user-supplied maximal nodal degree. The nodal degree is defined as 
the number of incident resistors (A^^) that provides no more than A^^^^^ = 
Nr{Nr — 1) new entries in the conductance matrix. Pure capacitance branches are 
not considered. We think that it is desirable to take into account the entries in the 
superposition of conductance and capacitance matrices because simulations both 
in time and frequency domains require linear combination of the matrices. In this 
case the maximal number of new entries is evaluated by N^^^ — A^rc(^«c — !)> 
where Nrc is the number of incident RC-branches. 

The inductance elimination generates new entries (Fig. 8.2) due to capacitances 
Cij (8.9). The entry is not produced for branches with zero conductances, hence the 
maximal number of new entries is AO;(A^r ~ !)■ Besides INj^ new entries are 
produced by each inductance coupled with the eliminated one (Fig. 8.4) in 
accordance with (8.15). The factor 2 appears because LC-coupling 0(ij)k corre- 
sponds with two entries 0,i, 0j,t (8.19). Each LC-coupling (Fig. 8.5) produces 
2(A^^ + 1) new entries due to (8.24). Thus the number of new entries after 
inductance elimination can be evaluated as 

<ew = ^«(^LL + 1) + 2Nlc{Nrc + 1). (8.32) 

Here A^^^ is the number of inductance couplings of eliminated inductance, Ni^c 
is the number of the LC couplings not counting the couplings with incident 
branches. 

In the general case incident branches of an RC node can posess LC couplings 
with circuit inductances (Fig. 8.7). All couplings with one inductance produce A^^ 
new entries in the matrix 0. Thus the total number of new entries after RC-node 
elimination is 

<ew = Nrc{Nrc - 1) + A^rA^lc, (8.33) 

where N^c is the number of inductances coupled with any RC-branch of elimi- 
nated node. 

The TICER algorithm [3] extended to include inductance eliminations contains 
the following operations. The sorted queue of internal nodes with no more than 
one connected inductance is supported after each elimination step. The fields of the 
sorting key are the number of new entries (8.32, 8.33) and nodal time constant 
(8.31). Maximal values of the keys are user-defined data. 

The elimination is performed for the first node in the queue by applying either 
(8.2, 8.30) for an RC-node or (8.9, 8.15, 8.24, 8.25) for the node with inductance. 
If passivity conditions (8.10, 8.20) are not satisfied then the elimination is ignored 
and the next node in the queue is processed. The algorithm stops if the queue is 
empty or all nodes in the queue do not satisfy passivity conditions. 

Experiments show that computational efforts of inductance elimination are 
typically 2-4 times greater than efforts of RC-node (TICER) elimination. This is 
resulted from more complicated analysis of the circuit structure and the need of 
passivity verification. 
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Two circuit examples are presented below to illustrate the proposed inductance 
elimination algorithms. 

The first example is an RLC line containing ten identical sections presented in 
Fig. 8.3 with parameters r = 1 Q, L = 0.4pH, C = 1 pF. Note that inductance 
value is slightly smaller than the maximum value guaranteeing passivity 
(L<0.5pH). 

Two reduced circuits were obtained: 

• RC line using the elimination of all inductances by the proposed method; 

• RC line using the neglect of all inductances. 

The frequency- and time-domain response of the original RLC line as well as 
two reduced circuits are computed. The time-domain simulation was performed 
under a unit step excitation with fnse = 3 psec. 

The reduction error for each method was evaluated as the difference between 
the responses of the original and reduced circuits (Fig. 8.8). One can see that the 
elimination error is about tenfold less than the neglect error both in frequency 
(Fig. 8.8a) and time-domain (Fig. 8.8b). 

The second example presents inductively coupled branches shown on the left of 
Fig. 8.6 with parameters r^ — r2 — 5Q., Li = L2 = 1 pH, L12 = 0.5 pH. Fur- 
thermore, load capacitances IpF connected to nodes 2, 4 are added. Nodes 1 and 4 
are assumed as input and output ports respectively. The reduced circuit is obtained 
in the form shown on the right of Fig. 8.6. Output frequency responses are shown 
in Fig. 8.9a. Time-domain crosstalk waveforms under an input pulse with unit 
magnitude and fpuise — 20psec,rrise — lpsec,ffaii ~ 4 psec are given in Fig. 8.9b. 
Note that in spite of large errors in the frequency response of the reduced circuit at 
/> 100 GHz the error in the time domain crosstalk waveform is sufficiently small 
because time-constants of eliminated nodes Li/ri — L2/r2 — 0.2 psec are essen- 
tially less then t^ise and ffan. 



(a) 

ODoe 

0.007 
0.006 
0.005 
0.004 
0.003 
0.002 
0.001 



(b) 





> 








7 








J 








t 








7 






.._ ./ 


7 


/-j\ _\ 




^ 




w 


K,^^ 



1e<09 1e»10 

frequency (Hz) 



0.001 

-0.001 
4.002 
-0.003 
-0.004 
-0.00$ 
-0.006 



A® 


^ 




— 


1 


/ : 








/ 




/ 






/ 






/ 








\j® 









1e-10 1.5e-10 
time (sec) 



2e-10 2.5e-1 



Fig. 8.8 RLC-line reduction errors due to the elimination (1) and the neglect (2) 
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Fig. 8.9 Crosstalk frequency (a) and time-domain (b) responses in inductively coupled branches 
(1) and in the reduced circuit (2) 



8.8 Conclusion 

A new class of elimination algorithms for circuits with small inductances is pre- 
sented in the paper. The proposed algorithms provide a reduction of an arbitrary 
RLCK circuit preserving first order accuracy of the circuit transfer functions. 
A passivity condition for the reduced circuit is given, which can be easily verified 
at each elimination step. Minor corrections in circuit simulators are required to 
perform the simulation of the reduced circuits. This work was supported by RFBR 
under grant 08-07-0017 la. 
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Chapter 9 

Simulation of Coupled Oscillators Using 
Nonlinear Phase Macromodels and Model 
Order Reduction 

Davit Harutyunyan and Joost Rommes 



Abstract Oscillators are used in many integrated RF circuits. Since their behavior 
is highly nonlinear, full system simulation can be expensive. Furthermore, the 
behavior of an oscillator can be (un)intendedly perturbed by that of other com- 
ponents and oscillators. We apply a method to build nonlinear phase macromodels 
of voltage controlled oscillators and show how these can be used to predict the 
behavior of oscillators under perturbation. Model order reduction techniques are 
used to decrease simulation times. Numerical results for realistic design illustrate 
the proposed approach. 

Keywords Behavioral modeling • Circuit simulation • Injection locking • Phase 
noise • Pulling • Voltage-Controlled Oscillators (VCOs) 



9.1 Introduction 

The request for more functionality on a smaller physical area makes the design 
of modern RF (radio frequency) integrated circuits increasingly more compli- 
cated. Modern RF chips for mobile devices, for instance, typically have an FM 
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radio, Blue-tooth, and GPS on one chip. Each of these functionaUties are 
implemented with Voltage Controlled Oscillators (VCOs), that are designed to 
oscillate at certain different frequencies. Such oscillators are influenced by 
unintended (parasitic) signals coming from other blocks (such as Power 
Amplifiers) or from other oscillators, via for instance (unintended) inductive 
coupling through the substrate. A possibly undesired consequence of the 
perturbation is that the oscillators lock to a frequency different than designed 
for, or show pulling, in which case the oscillators are perturbed from their free 
running orbit without locking. This makes floor planning, i.e., determining the 
locations for the functional blocks, one of the most challenging tasks in RF 
chip design. 

Our motivation comes from the design of RF systems, where oscillators play an 
important role [4, 8, 12, 21] in, for instance, high-frequency phase locked loops 
(PLLs). The nonlinear dynamics of interest include changes in the frequency 
spectrum of the oscillator due to small noise signals (an effect known as jitter [8]), 
which may lead to pulling or locking of the oscillator to a different frequency and 
may cause the oscillator to malfunction. Since both phase and amplitude dynamics 
are strongly nonlinear and spread over separated time scales [17], simulation is 
difficult. Accurate simulation requires very small time steps during time integra- 
tion, resulting in unacceptable simulation times that block the design flow. Fur- 
thermore, transient simulation only gives limited understanding of the causes and 
mechanisms of the pulling and locking effects. Oscillators appear in many other 
physical systems and applications, see e.g. [2, 18]. 

Here we use the nonlinear phase macromodel introduced and developed in 
[8, 10, 11, 16, 17, 25]. Contrary to linear macromodels [1, 16, 21], the nonlinear 
phase macromodel is able to capture nonlinear effects such as injection locking. 
Because the macromodel replaces the original oscillator system by a single scalar 
equation, simulation times are decreased while the nonlinear oscillator effects can 
still be studied without loss of accuracy. We use the macromodels to predict the 
behavior of inductively coupled oscillators. 

In some applications one exploits the coupling of oscillators. To reduce 
clockskew (clocksignals becoming out of phase), for instance, oscillators can be 
coupled via transmission lines [9]. Since accurate models for transmission lines 
can be large, this may lead to increased simulation times. We show how model 
order reduction techniques [3, 5, 22] can be used to decrease simulation times 
without unacceptable loss of accuracy. 

The paper is organized as follows. Section 9.2 gives a summary of the phase 
noise theory. In Sect. 9.3 we show how the phase noise theory can be used to 
analyze oscillator-balun coupling. In Sect. 9.4, we explain the coupling of oscil- 
lators via transmission lines. Application of model order reduction techniques in 
simulation of coupled oscillators is described in Sect. 9.5 Numerical results are 
presented in Sect. 9.6 and Sect. 9.7 concludes. 
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9.2 Phase Noise Analysis of Oscillators 

A general free-running oscillator is described as an autonomous system of dif- 
ferential equations: 

'^+;W=0, (9.1a) 

x(0)=x(r), (9.1b) 

where x{t) £ M" are the state variables, T is the period of the free running oscil- 
lator, which is in general unknown, q,j : M." — ^ R" are (nonlinear) functions and 
n is the system size. The solution of (9.1a, b) is called periodic steady state (PSS) 
and is denoted by Xp„. Although finding a PSS solution can be a challenging task 
in itself, we will not discuss this in the present paper and refer to, for example, 
[6, 10, 13-15, 23]. 

A general oscillator under perturbation can be expressed as a system of dif- 
ferential equations 

^+Xx)=MO, (9.2) 

where b{t) £ M" are perturbations to the free running oscillator. For small per- 
turbations b(t) it can be shown [8] that the solution of (9.2) can be approximated by 

Xp{t)^Xp,,{t + ci{t)), (9.3) 

where a(f) G M is called the phase shift, which satisfies the following scalar 
nonlinear differential equation: 

di{t)^V'^{t + oi{t))-b{t), (9.4a) 

a(0) = 0, (9.4b) 

with y(f) e W being the perturbation projection vector (PPV) [8] of (9.2). The 
PPV is a periodic function with the same period as the oscillator and can efficiently 
be computed directly from the PPS solution, see e.g. [7]. 



9.3 Oscillator Coupled to a Balun 

In this section we consider the mathematical model of an oscillator inductively 
coupled to a balun. A balun is an electrical transformer that can transform balanced 
signals to unbalanced signals and vice versa. A schematic view of an LC oscillator 
coupled to a balun with mutual inductors is given in Fig. 9.1. The corresponding 
mathematical model is given by the following set of equations: 
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Fig. 9.1 Oscillator coupled 
with a balun 
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v, secondary balun 




Ct'^ + ^ + .W+5tanh(fv,w)=0, 



df df 

dv2(f) , V2(r) 



df 



C2 



dr +^+'2W+/(0=0, 
d/i(r) d;3(f) 

Mi2^^ + M23^^ - V2(f) = 0, 



df 

dV3(f) V3(f) 



df 



C3^ + ^+/3W = 0, 
L3^ + M,3^ + M23^-V3(f)=0, 



dt 



df 



df 



(9.5a) 

(9.5b) 
(9.5c) 
(9.5d) 
(9.5e) 
(9.5f) 



where My — kjjy/LiLj, ij — 1,2,3, i<j is the mutual inductance and kij is the 

coupling factor. The parameters of the nonlinear resistor are 5 — l/Ri and G„ = 

— l.l/Ri and the current injection in the primary balun is denoted by /(f). 

d/2(f) d/3(f) 

For small coupling factors we can consider M12 h M13 in (9.5b) as a 

df df 

small perturbation to the oscillator and apply the phase shift macromodel to 

(9.5a)-(9.5b). Then the reduced model corresponding to (9.5a)-(9.5b) is 



da(t) 
df 



y'c + «(0) • ( _M,2^- M,,^ ' ^9.6) 



df 



df 



where V is the PPV of the oscillator. The balun is described by a linear circuit 
(9.5c)-(9.5f) which can be written in a more compact form: 
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dx(t) 
dt 



-^Ax{t) +Bu(t), 
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(9.7) 



(9.8a) 



(9.8b) 



With these notations (9.6) and (9.7) can be written in the following form 

f-V^it+m{£,) (9.9a) 

A<0 



dt 



--Ax{t) +Bu{t), 

y{t) = ex, 



(9.9b) 
(9.9c) 



where 6^ = (0, -Mi2,0, -M13) and ii(t) is computed by using (9.3). 



9.4 Oscillator Coupling to a Transmission Line 

In some applications oscillators are coupled via transmission lines. By coupling 
oscillators via transmission lines, for instance, one can reduce the clockskew in 
clock distribution networks [9]. Accurate models for transmission lines may 
contain up to thousands or millions of RLC components [26]. Furthermore, the 
oscillators or the components that perturb (couple to) the oscillators can consist of 
many RLC components, for instance when ones takes into account parasitic 
effects. Since simulation times usually increase with the number of elements, one 
would like to limit the number of (parasitic) components as much as possible, 
without losing accuracy. 

For the sake of simplicity in this paper we consider an RC transmission line 
coupled to an oscillator, see Fig. 9.2. Using phase macromodel for oscillator and 
by applying KirchhoflF's current law to the transmission line circuit, we obtain the 
following set of differential equations: 



da(r) 
df 



y^(/+a(f)) 



.v(0-v(0 





(9.10a) 
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Fig. 9.2 Oscillator coupled to a transmission line 
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Fig. 9.3 Two oscillators coupled via a transmission line 

ck(r) 



df 



:Ax{t) +Bu{t), 

y(t) = ex, 



where 



E = diag(Ci, C2, . . ., C„), A = tridiag( — , -— - - — ,- — 



5 = 



' 



Vo 1/ 



x(0 = 



V2(f) 



Vn{t)) 



, M(r) 



v(0 



, e = 



(l\ 



Vo/ 



(9.10b) 
(9.10c) 

(9.11a) 
(9.11b) 



In a similar way the phase macromodel of two oscillators coupled via a 
transmission line, see Fig. 9.3, is given by the following equations: 



dai(f) 
df 



yf(r+ai(f)) 







E—^^Ax{t)+Bu{t), 



(9.12a) 
(9.12b) 
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df 



V^it + 0i2{t)) 



''..(0-t'o(0 

Rn+l 





(9.12c) 



where ai(t) and 01.2(1) (Vi and 1^2) are phase shifts (PPV's) of the corresponding 
oscillator. The matrices E, A and x are given by (9.11a, b) and 



B 



(i M 




vo ^y 



, «(?) 



v(f) 
vo(f) 



(9.13) 



9.5 Model Order Reduction 

Model order reduction (MOR) techniques [3, 5, 22] can be used to reduce the 
number of elements significantly. Here we show how model order reduction can be 
used for the analysis of oscillator perturbation effects as well. Since the main focus 
is to show how MOR techniques can be used (and not which technique is the most 
suitable), we limit the discussion here to balanced truncation [20]. For other 
methods, see, e.g., [3, 5, 22]. 

Given any dynamical system (A, B, C) 



^ ck(f) 
dt 



Ajc{t) + Bu{t), y{t) = C^x, 



and assuming E — I, balanced truncation [20] consists of first computing a bal- 
ancing transformation V G M."^", where B and C are input- to- state and state- 
to-output vectors, respectively. The balanced system {V^AV,V^B,V^C) has the 
nice property that the Hankel singular values' are easily available. A reduced order 
model can be constructed by selecting the columns of V that correspond to the 
k < n largest Hankel singular values. With Vk € W^'' having as columns these 
k columns, the reduced order model (of order k) becomes {V^AVk, V^B, V^C). If 
E j^ I is nonsingular, balanced truncation can be applied to {E^^A,E^^B, C). For 
more details on balanced truncation, see [5, 20, 22]. 

In this paper we apply model order reduction to linear circuits that are coupled 
to oscillators, and the relevant equations for each problem describing linear cir- 
cuits have the form of (9.9b)-(9.9c). For each problem the corresponding matrices 
E, A, B, and C can be identified readily, see (9.8a, b), (9.11a, b), (9.13) and note 
C = C. We use Matlab [19] implementation for balanced truncation to obtain 
reduced order models: 



Similar to singular values of matrices, the Hankel singular values and corresponding vectors 
can be used to identify the dominant subspaces of the system's statespace: the larger the Hankel 
singular value, the more dominant. 
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sys = ss( -E\A, -E\B, C, ) ; 

[hsv, baldata] = hsvd(sys); % Hankel singular values 

mor_diTn = nnz ( (hsv>le-10) ) ; % choose largest singular 
% values where mor_dim is 
% the dimension of the reduced system 

rsys = balred (sys , mor_dim, 'Elimination' , 'Truncate' , . . . 

' Balancing' , baldata) ; %truncate 

Note that we can apply balanced truncation because E is nonsingular. It is well 
known that in many cases in circuit simulation the system is a descriptor system 
and hence E is singular. Although generalizations of balanced truncation to 
descriptor systems exist [22, 24], other MOR techniques such as Krylov subspace 
methods and modal approximation might be more appropriate. We refer the reader 
to [3, 5, 22] for a good introduction to such techniques and MOR in general. 



9.6 Numerical Experiments 

In all the numerical experiments the simulations are run until Tunai = 6 x 10^^ s 
with fixed time step x — 10~". In this section all the numerical results done with 
the phase macromodel combined with MOR technique are called macromodel- 
MOR simulation. 

We compare our results with simulation results of the full circuit (no macro- 
modeling, no model order reduction), hereafter full-simulation. Because the full 
circuit represents a stiff ODE, we use the Matlab built-in ODE solver odelSs with 
relative tolerance set to 10~^ (larger values of relative tolerance result in incorrect 
solutions) to achieve a comparable accuracy with the macromodel-MOR simula- 
tion results. In all experiments we observed that the simulation time of the mac- 
romodel-MOR technique is typically five times faster than full-simulation times. 



9.6.1 Oscillator Coupled to a Balun 

Consider an oscillator coupled to a balun as shown in Fig. 9.1 with the following 
parameters values: 

Oscillator Primary balun Secondaiy balun 

Li =0.64x lO-'H L2 = l.lOx lO-'H L3 = 3.60 x IQ-'H 

d = 1.71 X lO-'^F C2 = 4.00x lO-'-F C3 = 1.22 x lO^'^F 

Ri=50Q R2=40n R2 = 60n 
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Fig. 9.4 Comparison of the output spectrum of the oscillator coupled to a balun obtained by the 
macromodel-full and the macromodel-MOR simulations for an increasing injected current 
amplitude A\^^ and an offset frequency /off = 20 MHz 



The coefficients of tlie mutual inductive couplings are kyi — 10 ^^,^13 — 5.96x 
10"^, ^23 — 9.33 X 10"^. The injected current in the primary balun is of the form 



^(f) = ^inj sin(27t(/o -/off)0, 



(9.14) 



where /o = 4.8 GHz is the oscillator's free running frequency, /off is the offset 
frequency and Ainj is the current amplitude. 

Simulation results of (9.9a, b, c) are shown in Fig. 9.4, where the frequency is 
plotted versus power spectral density (PSD^). The obtained results by the mac- 
romodel-MOR technique with mor_dim = 2 provide a good approximation to the 
full-simulation results. We note that for the injected current with Ainj = 10"' A the 
oscillator is locked to the injected signal. Similar results are also obtained for 
the balun. 



Matlab code for plotting the PSD is given in [12]. 
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Fig. 9.5 Comparison of the output spectrum around the first and third harmonics of the oscillator 
coupled to a transmission line, cf. Fig. 9.2. a First harmonic, b Third harmonic 

9.6.2 Oscillators Coupled with Transmission Lines 

In tliis section we consider two academic examples, wfiere transmission lines are 
modeled with RC components. 



Single Oscillator Coupled to a Transmission Line 

Let us consider the same oscillator as given in the previous section, now coupled to a 
transmission line, see Fig. 9.2. The size of the transmission line is « = 100 with the 
following parameters: Ci = • • • = C„ = 10"^ pF,^i = 40kQ,/?2 = ■ ■ = Rn = 
1 Q. The injected current has the form (9.14) with Ai„j = 10"^ A and /off = 20 MHz. 
Dimension of the reduced system is mor_dim = 18. Simulation results of (9.10a, 
b, c) around the first and third harmonics (this oscillator does not have a second 
harmonic) are shown in Fig. 9.5. The macromodel-MOR method, using techniques 
described in Sect. 9.5, gives a good approximation to the full simulation results. 



Two LC Oscillators Coupled via a Transmission Line 



For this experiment we consider two LC oscillators coupled via a transmission line 
with the mathematical model given by (9.12a, b, c). The first oscillator has a free 
running frequency /i = 4.8 GHz and is described in Sect. 9.6.1. The second LC 
oscillator has the following parameter values: Rq = 50Cl,Lo — 0.64nH, 
C() — 1.87 pF and a free running frequency /2 = 4.6 GHz. The size of the trans- 
mission line is n = 100 with the following parameters: Ci = • • • = C„ = 
lO^^pp^^j = /^^^j = 4kQ,/?2 = • • • = ^n = 0.001 Q. Dimension of the reduced 
system is mor_dim = 16. Numerical simulation results are given in Fig. 9.6. As 
can be seen from the spectra of Fig. 9.6, the macromodel-MOR and full simulation 
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Fig. 9.6 Comparison of the output spectrum around the first and third harmonics of two 
oscillators coupled via a transmission line, f^ = 4.6 GHz : a first harmonic, b third harmonic; 
/i = 4.8 GHz : c first harmonic, d third harmonic 



results match well: with both simulation methods the carrier and beat frequencies 
are the same with a comparable PSD. 



9.7 Conclusion 



In this paper we have used nonlinear phase macromodels to accurately predict the 
behavior of individual and mutually coupled voltage controlled oscillators under 
perturbation. Several types of coupling have been described, including oscillator- 
balun coupling. For small perturbations, the nonlinear phase macromodels can also 
be used to produce results with accuracy comparable to full circuit simulations [11]. 
In addition, model order reduction techniques have been applied to transmission 
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lines that couple oscillators. With these techniques, reduced-order models could be 
obtained that decreased simulation times while preserving the required accuracy. 
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Chapter 10 

POD Model Order Reduction of 
Drift-Diffusion Equations in 
Electrical Networks 

Michael Hinze, Martin Kunkel and Morten Vierling 



Abstract We consider integrated circuits witli semiconductors modelled by 
modified nodal analysis and ID drift-diffusion equations. The drift-diffusion 
equations are discretized in space using finite element methods. The space dis- 
cretization yields a high dimensional differential-algebraic equation. We show 
how POD methods can be used to reduce the dimension of the model. We compare 
reduced and fine models and give numerical results for a basic network with one 
diode. Furthermore we discuss an adaptive approach to construct POD models 
which are valid over certain parameter ranges. Finally, numerical investigations 
for the reduction of a 4-diode rectifier network are presented, which clearly 
indicate that POD model reduction delivers surrogate models for the diodes 
involved, which depend on the position of the semiconductor in the network. 
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Fig. 10.1 Basic test circuit with one diode. The network is described by 

Ar = {0, l)\ 

8{Ale,t) = -e2(t) 



10.1 Introduction 

In this article we investigate a POD-based model order reduction for semicon- 
ductors in electrical networks. Electrical networks can be efficiently modelled by a 
differential-algebraic equation (DAE) which is obtained from modified nodal 
analysis. Denoting by e the node potentials and by ji and yV the currents of 
inductive and voltage source branches, the DAE reads (see [9, 19]) 



Ac — qciAle,t) +Afig(Aje,f) +AJl+AvJv = -A,U{t), 



dt 



b^(jL,t)~Ale = 0, 



Aye 



V..W- 



(10.1) 

(10.2) 
(10.3) 



Here, the incidence matrix A = [Ar,Ac,Al,Av,Aj] represents the network 
topology and qc, g and (f)j^ are continuously differentiable functions defining the 
voltage-current relations of the network components. The continuous functions Vj 
and is are the voltage and current sources. For example consider the network in 
Fig. 10.1. Under the assumption that the Jacobians 



Dc{e,t) 



^qc 



(e,f), DG{e,t) 



8e 



(e,f), Dz. (/-,?) 



8^ 
9y 



-(/■,f) 



are positive definite, analytical properties (e.g. the index) of DAE (10.1)-(10.3) are 
investigated in [5] and [6]. In linear networks, the matrices Dq, Dq and £)^ are 
positive definite diagonal matrices with capacitances, conductivities and induc- 
tances on the diagonal. 

Often semiconductors themselves are modelled by electrical networks. These 
models are stored in a library and are stamped into the surrounding network in 
order to create a complete model of the integrated circuit. Here we use a different 
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approach which uses the transient drift-diffusion equations as a continuous model 
for semiconductors. Advantages are the higher accuracy of the model and fewer 
model parameters. On the other hand, numerical simulations are more expensive. 
For a comprehensive overview of the drift-diffusion equations we refer to [1, 2, 4, 
11, 15]. Using the notation introduced there, we have the following system of 
partial differential equations for the electrostatic potential i/'(f,x), the electron and 
hole concentrations n{t,x) said p{t,x) and the current densities Jn{t,x) and Jp{t,x): 

div(£ giadij/) — q{n — p — C), 
-qd,n + div/„ = qR{n,p,J„,J,,), 
qd,p + divJp = -qR{n,p,J„,Jp), 

J„ = H„q{UT gradn - n gradi//), 

with {t,x) € [0, T] X Q. and Q C W'. The nonlinear function R describes the rate of 
electron/hole recombination, q is the elementary charge, £ the dielectricity, /,(„ and 
fip are the mobilities of electrons and holes. The temperature is assumed to be 
constant which leads to a constant thermal voltage Uj- The function C is the time 
independent doping profile. Note that we do not formulate into quasi-Fermi 
potentials since the additional non-linearities would imply higher online simula- 
tion time for the reduced model. 

We specify boundary conditions for Ohmic contacts Fq C F :— [0,T] x 8Q 
with normal vector v: 



Yc(x) V 4«2 + C(x)^ 



iP{t,x) = iPhi{x)+e{x,t)^UTlog\^ :^ +e{x,t), (10.4) 



n{t,x) - ^ Uc{xf + 4«2 + C(x) j , 
(^^Cixf + 4nj-C{x)y, 



(10.5) 



p{t,x) - i ( \/C{xY + 4nj - C{x) ] , (10.6) 

for {t,x) e Fq- Here, e denotes the applied potential at the interface boundaries, 
4>hi{^) is the build-in potential and «, the constant intrinsic concentration. 
At isolated boundaries F/ C F it holds grad ij/ ■ v = 0, J„ ■ v = and Jp ■ v = 0. 

The coupling between the drift-diffusion equations and the electrical network is 
established by the applied potential e(x, t) at the interfaces and the current through 
the semiconductor which can be calculated as 

js = / {J„ + Jp - £8( gradi/^) • vda, (10.7) 

To 

e.g. the current is the integral over the current density J„ + Jp plus the displace- 
ment current. The complete model forms a partial differential-algebraic equation 
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(PDAE). The analytical and numerical analysis of such systems is subject to 
current research, see [3, 8, 17, 19]. 

This paper is organized as follows. In Sect. 10.2 we present the model for the 
complete coupled system, meaning the network including semiconductors. The 
coupled model then is simulated using finite-element methods in Sect. 10.3. This 
gives us so-called snapshots yt := y{ti), i — 1, . . .,k, which represent the state of 
the circuit and the semiconductors at time f,. Based on these snapshots and POD 
we construct a reduced model in Sect. 10.4. A numerical investigation of the 
model is presented in Sect. 10.5 where also advantages and shortcomings of our 
approach are discussed. 



10.2 Complete Coupled System 

In the present section we develop the complete system in ID. The «, semicon- 
ductors are diodes with length L^, k = 1, . . .,«,. For the sake of simplicity we 
assume that the contacts are Ohmic, and that the dielectricities £^ are constant over 
the whole domain Q := [0,Li]. Furthermore we focus on the Shockley-Read-Hall 
recombination 

R{n,p) - ' • 



which does not depend on the current densities. x„ and tj, are the average lifetimes 
of electrons and holes. 

The simulation of the complete coupled system is expensive and numerically 
difficult due to bad scaling of the drift-diffusion equations. The numerical issues 
can be significantly reduced by the unit scaling procedure discussed in [15], e.g. by 
substituting 

x = n:, i^ = UTh « = llc|L«, P = \\c\\^P, c=|ic|Lc, 

_ qUT\\C\\^ ~ ^ qUrWCl^ ~ „_,^||^|| 

Jn — , ni-'rij Jp — J h'-p-'pj "/■ "/ll'-'lloc- 

The scaled complete coupled system is constructed as follows. (We neglect the 
tilde-sign over the scaled variables.) The current through the diodes must be 
considered in Kirchhoff's current law. Consequently, the term Asjs is added to 
Eq. 10.1, e.g. 

Ac-qciAj.e,t) +ARg{Ale,t)+AuL+Avjv +Asjs ^ -A,i,{t), (10.8) 

^^^^{j,,t)-Ale = Q, (10.9) 
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Ale^v,{t). (10.10) 

The voltage-current relation for the semiconductor k is established by the 
couplings 



-^^4^(/^„/«(M) + V.(M))-o^'- 



;„(r) = a'L-nLl^ {^j^i^t, 1) + V,(r, 1)) -a-^d,Mt, 1), (lo.n) 



^(f,0)=^, (10.12) 

^(,j)^r^ii^ SA^^ (10.13) 

Uj 

which is basically the evaluation of Eqs. 10.4 and 10.7. The left boundary is chosen 
as reference terminal, a denotes the area size of the contact of the diode. In order to 
simplify the presentation, we neglect the index k for the semiconductors wherever 
possible. The scaled drift-diffusion equations for each semiconductor read 

ld^,,il/ = n~p-C, (10.14) 

-8,n + v„8.,/„ = R{n,p), (10.15) 

8,p + v,AJp = -R{n,p), (10.16) 

J„^d,,n~nd,,il/, (10.17) 

Jp = -c),p - pd,^ (10.18) 

with 1 :— jj ''ij^ii , v„ := -jj^ and v^ := -^. Finally we evaluate the boundary 
conditions (10.5) and (10.6) at the Ohmic contacts, e.g. 

n{t,0)=\(^^C{Qf + An} + C{Q)y n{t,l) =\(^^ C{\f + An] + €{1)^, 



P(f,0) = - ^^JC{Of+An] - C(0)j , Pit, 1) - - (^VC(l) + 4«f - C(l) 

for all te [0,T]. 

Note, that in ID simulations, there are no boundary conditions for /„ and Jp, 
since all boundaries are connected to the network, e.g. Fg = {0, 1} and Yj — 0. 



10.3 Simulation of tlie Full System 

Classical approaches for simulation of drift-diffusion equations (e.g. Gummel 
iterations [7], a nonlinear Gauss-Seidel approach) approximate J„ and Jp by 
piecewise constant functions and then solve equations (10.15) and (10.16) with 
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respect to n and p explicitly. This helps reducing the computational effort and 
increases the numerical stability. For the proposed model order reduction this 
method has the disadvantage of introducing additional non-linearities, since the 
discrete solution of n and p is build up in terms of exp tp, see [17]. 

Here we consider two complementary finite element approaches to the drift- 
diffusion equations, namely standard Galerkin and mixed finite element methods. 
We start with a standard Galerkin approach. For the sake of simplicity we use an 
equally distributed finite element mesh with A^ elements and mesh width h := l/N. 
The functions ij/, n and p are approximated by piecewise linear and globally con- 
tinuous functions, /„ and Jp are approximated by piecewise constant functions, e.g. 

N N N 

il/{t,x) := ^ !//,.(/)(/),. (x), n{t,x) := ^«; (f ),/.,. (x), p{t,x) := ^p;(f)</),.(x), 

/=0 (=0 /■=0 

N N 



Jnit,x) := ^7„j(f)(p,.(x), Jpit,x) := J^Jp.MViix) 



where the functions {</),} and {ip,} are the corresponding ansatz or trial functions. 
For ip, n and p only the interior coefficients, e.g. i//(r) — (i/^i, . . ., i//^_i) , are 
variable, the coefficients corresponding to the boundary elements are given by the 
Dirichlet boundary conditions. Note that the time is not discretized at this point 
which refers to the so-called method of lines. The finite element method leads to 
the following DAE for the unknown vector-valued functions of time i//, n, p, J,,, Jp 
for each semiconductor: 

= ;.5iA(f) + Mn{t) - Mp{t) - C, + b^{e{t)), 
-Mh{t) = -v„D^J„{t)+hR{n{t),p{t)), 
Mp{t) - -VpD^Jp{t) - hR{n{t),p{t)), (10.19) 

= hJ„{t) + Dn{t) - diag{Bn{t) + b„) Dxi/{t) + b„, 
= hJp{t) - Dp{t) - diag{Bp{t) + bp) Dxl/{t) + bp, 

where 5,M e M*'^"'''''^"'^ and D,5 e M^'^*^"'' are assembled finite element 
matrices, diag(v) denotes the operator which takes a vector v and creates a 
diagonal matrix with v on the diagonal. The vectors b^{e{t)), b„, b„, bp and bp 
implement the boundary conditions imposed on xj/, n and p, which are given by 

«o(f) = \ [^JciQf + ^n^ + C(0)^ , «^(r) - i (^^C{\)- + An] + C(l)^ , 
Po{t)^\Uc{Qf + ^n]-C{Q)\ p:,{t)M^C{\f + An}-C{\) 
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Discretization of the coupling condition for the current completes the discretized 
system: 

Supposing that the fluxes J„ and Jp as well as the gradient of ij/ play a dominant 
role in (10.14)-(10.18) one might argue, that they should be resolved better than as 
piecewise constant functions. This directly leads to the Raviart-Thomas finite 
element approach, where the concentrations n and p and the potential ij/ are 
approximated by piecewise constant functions, but the fluxes Vi/', /„ and Jp are 
elements of the Raviart-Thomas space of order zero. In one space dimension, the 
Raviart-Thomas approach leads to piecewise linear, continuous approximations to 
these functions, so that the ansatz space is just span{(/)o, . . ., (t>N}- The ansatz space 
for ij/, n and p is given by spanjipi, . . ., tp^}. The idea is now not to discretize 
(10.14)-(10.18) separately, but instead to discretize variable-flux pairs together. 
For example the Eqs. 10.15, 10.17 

-8,n + v„8,.J„ ^R{n,p), 

J„ = dx^ — nSvi/', 

define the variable n and its flux J„. The first equation is tested with q), the second 
is tested with cj) and integrated by parts to obtain 



• / {8,n)(p+v„ / (p{8xJn) = / R{n,p)(p, 



J„<P = - n{6,(t)) ~ / n(8.,i/^)(/) + [nc/)],,. 
n n n 

This formulation avoids space-derivatives of «. The pairs of Eqs. 10.16, 10.18 
and 10.14, 10.20 are treated in a similar way. By using {cpj} and {(pi) as test and 
trial functions and by introducing a new variable g^j, and the equation 

8^ = 8.^, (10.20) 

we arrive at the following finite dimensional system of equations: 

= ADg,/,(f) + hn{t) - hp{t) + Ch 
-hh{t) = v„DJ„{t) + hR{n{t),p{t)) 
hp{t) = VpDJpit) - hR{n{t),p{t)) 

= -D^n{t) + MJ„{t) + B(g^(f), «(?)) - k 
0=~D'^p{t)+MJp{t)+-B{g^{t),p{t))-bp 
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Table 10.1 Diode model 
parameters 


Parameter 


Value 


L 


10-* [cm] 




Ut 


0.0259 [V] 




l^n 


1,350 [cm-/ Vsec] 




/^ 


480[cmVVsec] 




a 


10-5 [^^2] 




B 


1.03545 X 10-'2[F/cm] 




m 


1.4 X 10'0[l/cm3] 




T„ 


330 X lO-' [sec] 




Tp 


33 X 10-" [sec] 




C(x), x<L/2 


-9.94 X 10'5 [l/cm^] 




C(x), x>L/2 


4.06 X 10"* [l/cm3] 


02 


frequency = 1 .Oe+06 


02 


frequency = 1 .Oe+09 


02 


frequency = 5.0e+09 


0.15 


/-^ 


0.15 




0.15 




5" 

^ 01 


A 


< 

^ 01 


A 


< 

^ 01 




2' 0.05 

t 

o 


/\ 


2" 0.05 

- t 
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J \ 


2" 0.05 
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_A 




^ 


w 


-0.05 
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1^ 1 
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Fig. 10.2 Current yV through the basic network for input frequencies 1 MHz, 1 GHz and 5 GHz. 
The capacitive effect is clearly demonstrated 



with sparse matrices D G R'^'^'-'^^^K M e ; 



UN+l)x{N+\) 



, a bilinear map B : 



p(A'+l) 



T>N+l 



and vectors C/, G M^ and b,^{e{t)),b„,bp € R^+^ 



The discretized equations are implemented in MATLAB, and the DASPK 
software package [13] is used to integrate the high dimensional DAE. Initial values 
are stationary states obtained by setting all time derivatives to 0. In order to solve 
the Newton systems which arise from the BDF method efficiently, one may reorder 
the variables of the sparse system with respect to minimal bandwidth. Then, one 
can use the internal DASPK routines for the solution of the linear systems. 
Alternatively one can implement the preconditioning subroutine of DASPK using 
a direct sparse solver. Note that for both strategies we only need to calculate the 
reordering matrices once, since the sparsity structure remains constant. 

A basic test circuit with one diode is depicted in Fig. 10.1, where the model 
parameters are presented in Table 10.1. The input Vs(?) is chosen to be sinusoidal 
with amplitude 5 V. The numerical results in Fig. 10.2 show the capacitive effect 
of the diode for high input frequencies. Similar results are obtained in [16] using 
the simulator MFCS. 
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10.4 Model Reduction 

Now we want to reduce the computational effort of repeated dynamical simula- 
tions by applying proper orthogonal decomposition (POD) to the drift-diffusion 
equations. The POD reduction procedure is formulated in the Appendix, and from 
here onwards is used with X := L?{Q.). As an example we discuss the reduction 
based on the standard finite element approximation (10.19), Raviart-Thomas ele- 
ments are treated analogously. 

The test functions for a standard Dirichlet problem are expected to vanish at the 
boundary. Hence, in the standard finite element case, before performing the POD 
of the state space, we relate the solution to a reference state (not necessarily a 
solution), e.g. a stationary solution or a mean value, that fulfills the boundary 
conditions. The functions 

i// = i// - i/',., h = n-nr, p^p-Pr 

then satisfy homogeneous boundary conditions. The reference i//^ is an exception 
in so far, that in general it cannot be a stationary state, since i// underlies varying 
boundary conditions. Here, we use the stationary state, scaled in such a way that 
the boundary conditions are satisfied, e.g. 

^'■^ ' ^ " ^(0, 1) ^'-^ ' ^ ~ Aje(O) + lA,/^^ ' ^- 



We further set 



1 — J - f ~J — J - J' 



In the case of the Raviart-Thomas approach the relation to a reference state is 
not necessary, since the boundary values are included more naturally through the 
variational formulation. 

The time-snapshot POD procedure described in the Appendix delivers Galerkin 
ansatz spaces for xj/, h, p, J„ and Jp. This leads to the ansatz 

/°'^W = ^,(r) + t/^,//,^(0, 

«PO'^(r) = «,(r) + U„H„{t), /^^(f) = p,{t) + UpHp{t), 

7^(0 = J'„it) + UjHjM, ^r°(^) - ^;W + Ujr"j,.if)- 
The injection matrices 

Uj eR'^^*^", Uj eR^''-^'", 
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contain the (time independent) POD-functions as columns as in Eq. 10.21, the 
vectors //(.) the corresponding time-variant coefficients. The number A:(.) is the 
respective number of POD basis functions included. Assembling the POD system 
then leads to a DAE system similar to (10.19), 

= XSH^ + Uj,{M{U„H„ - UpHp) - Q) + ci + ~c,{e), 
-M„„H„ = -VnblUj,^ + hUj,R{nr + t/„//,„/7, + UpH,,) + cj, 
MppHp = -VpDj,Hj^ - hUjR{n,. + U„Hn,Pr + UpHp) + a, 
= /!//y„ + b„H„ + R{H,„H,^) + L„H,i, + C4, 
= hHj^ - bpHp + B{Hp, H,^) + LpH^ + c,. 

Here, 

S = UpU^, M„„ - UjMUn, Mpp = UjMUp, 
b„ - UjDU„, Dp = UjDUp, 

where the matrices M, S, and D and the vector C/, are the same as in Eq. 10.19. The 
constant vectors c,- and matrices L„, Lp arise from the reference states, and can be 
computed offline together with the other matrices and the bi-linear map B; C2 and 
C3 vanish in case of stationary reference states n,- and /?,.. The vector ci(e(r)) 
includes the boundary condition on ij/. Note, that all matrix-matrix multiplications, 
such as UJMU„ can be calculated in the offline-phase. The nonlinear recombi- 
nation function R must be evaluated online and cannot be reduced. Also the 
bi-linear B cannot be reduced completely. Besides the linear terms one can also 
expect a performance gain in the solution of the linear systems in BDF method, 
which are now relatively small. 



10.5 Numerical Investigation 

Figure 10.3 shows the development of the error between the reduced and the unre- 
duced numerical solutions, plotted over the neglected information A (see (10.22)), 
which is measured by the relative error between the non-reduced states ij/, n, p, J„, Jp 
and their projections onto the respective reduced state space. The number of POD 
basis functions for each variable is chosen so that the indicated approximation quality 
is reached, i.e. A := A,^ ~ A„ ~ A^ ~ Ay„ ~ Ay_. Since we compute all POD basis 
functions anyway, this procedure does not involve any additional costs. 

In Fig. 10.4 the simulation times are plotted versus the neglected information A. 
As one also can see, the simulation based on standard finite elements takes twice as 
long as that based on RT elements. However, this difference is not observed for the 
simulation of the corresponding reduced models. 
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Fig. 10.3 L2 error of ^V 
between reduced and 
unreduced problem, both for 
standard and Raviart-Thomas 
FEM 
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The whole POD approach in the present situation malces sense only, if the 
singular values would decay rapidly. Else one would have to use too many POD 
basis functions and would end up with a rather large and dense POD Galerkin 
system. To illustrate, that the singular values are indeed decaying exponentially. 
Fig. 10.5 shows the total number of singular vectors k — k^ + k„ + kj, + kj^^ + kj^ 
required to undercut a given state space cut-off error A. While the number of 
singular vectors included increases only linearly, the cut-off error tends to zero 
exponentially. 
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Fig. 10.5 The number of 

required singular values 
grows only logarithmically 
with the requested accuracy 



400 




Although POD model order reduction often works well, it is clearly a drawback 
of the method that the reduced system depends on the inputs and on the parameters 
of the system under consideration. A possible remedy consists in performing 
simulations over a certain input and/or parameter range and then to collect all 
simulations in a global snapshot matrix Y :— [Y^ ,Y^, . . .]. Here, each Y' represents 
the snapshots taken for a certain input resp. parameter. In this context now the 
question pops up which inputs/parameters to choose in order to obtain a reduced 
model, which is valid over the whole input/parameter range. Let us elaborate on 
this complex of questions with the following example. 

For the basic circuit we choose the frequency of the input voltage v, as 
parameter. Let the parameter space be the interval [10^ Hz, lO'^Hz]. We start the 
investigation with a reduced model where the snapshot matrix is created from the 
simulation of the full model at a frequency of 10'° Hz. The difference between 
simulations of the full model and the reduced model is the reduction error plotted 
in Fig. 10.6 (dashed line). A second reduced model is constructed by adding 
snapshots from the full simulation at a frequency of 10^ Hz, which is the fre- 
quency for which the error is maximal. We do not alter the number of considered 
singular values. One can see that the error is significantly reduced in the second 
model (dotted line). A third model is constructed analogously (solid line). 

Of course this adaptive reduction method only works in theory, since it is based 
on full model integrations over the whole parameter space. For practical purposes 
we will need to develop a-posterior error estimators in the parameter space. It may 
be possible to apply the methods of [12] to the problem under consideration. In the 
present situation also a parameter POD along the lines of [10] could be applied. 
The parameter snapshots are chosen as simulations of the quasi-stationary drift- 
diffusion model related to the respective parameter value (here: frequency of 
sinusoidal input voltage source). 



10 POD Model Order Reduction of Drift-Diffusion Equations in Electrical Networks 189 



Fig. 10.6 Reduction error 
plotted over the frequency 
parameter space. The reduced 
model was created based on 
1 to 3 full simulations at the 
reference frequencies. The 
reference frequencies are 
marked by squares. The 
number of singular values 
used for the reduction is the 
same for all three models 
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Fig. 10.7 Rectifier network and simulation results. The input v, is sinusoidal with offset + 1 .5 [V] 



Finally we note that the presented reduction method does not create reduced 
models for semiconductors as such, but accounts for the position of the semi- 
conductors in a given network. The modes or POD basis functions of two identical 
semiconductors may be different due to their different operating states. To dem- 
onstrate this fact, we consider the rectifier network in Fig. 10.7 and measure the 
distance between the spaces t/' and U^ which are spanned, e.g., by the POD- 
functions Ul of the diode ^i and U^ of the diode 52 respectively, by 



d{U\U^ 



max mm m 

«ef/' veu- 

Il"ll2 = l Il''ll2 = l 



Exploiting the orthonormality of the bases U\ and Ul and using a Lagrange 
framework, we find 
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Table 10.2 Distances 
between reduced models in 
the rectifier network 



10-^ 
10-5 
10"*^ 
10-' 



d{U'M^) 



0.61288 
0.50766 
0.45492 
0.54834 



d(U\U^) 



5.373 X 10-** 
4.712 X 10-** 
2.767 X 10-' 
1.211 X 10-* 



where X is the smallest eigenvalue of the positive definite matrix SS^ with 



Sij = 



, u 



.A,;'"iA.j72 



The distances for the rectifier network are given in Table 10.2. 
While the reduced model for the diodes S\ and ^3 are almost equal, the models for 
the diodes Si and S2 are significantly different. Similar results are obtained for the 
reduction of n, p, etc. 
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Appendix: Proper Orthogonal Decomposition 



Let y : Q. X [0, T] — > K be a function, which in the situation of Sect. 10.4 repre- 
sents the time continuous finite element approximation of 1/^, n, p, g^, Jn and Jp, 
respectively. For t e [0, T] let >'(•,?) € Y^ '■— span{(/)i, . . ., 0^}, where (/),• {i = 
I, . . .,N) denote linearly independent elements of a Hilbert space X. 

The snapshot variant of (POD) introduced in [18] works as follows; let 
Y = [y^ , ■ . ., v'] e R'^^' contain as columns the coefficient vectors of / time-snap- 
shots y{-, ti) taken at time instances r, e [0, T], i.e. 

Furthermore, let A^ := (((Am, </>„)x),„n=i n ^^^^ its Cholesky factorization 
M = LL^ . Let ({/, S, V) denote the singular value decomposition of Y := L^Y, i.e. 



Y = UI.V with U € 



zg 



and V S K and S a diagonal matrix 



containing the singular values < (Ji in decreasing order for I <i<l, where we 
assume KN. We set 



U:^L-^U^,i,,^ = [u\...,u']. 



;io.2i) 



Then, the A:-dimensional POD basis of spanj^-^j y'i'Pji ' = 1, . • ., '} ^ JV is given 
by 
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Note that one chooses k<m, where it„ > denotes the smallest non-vanishing 
singular value. The expression 1 — A(^) is a measure of the information content 
of the subspace spanned by the first k POD basis functions, where 




AW-^/1-^f^. (10.22) 

An extended introduction to POD can be found in [14]. 
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Chapter 11 

Model Reduction of Periodic Descriptor 

Systems Using Balanced Truncation 

Peter Benner, Mohammad-Sahadet Hossain and Tatjana Stykel 



Abstract Linear periodic descriptor systems represent a broad class of time 
evolutionary processes in micro-electronics and circuit simulation. In this paper, 
we consider discrete-time linear periodic descriptor systems and study the con- 
cepts of periodic reachability and observability Gramians. We also discuss a lifted 
representation of periodic descriptor systems and propose a balanced truncation 
model reduction method for such systems. The behaviour of the suggested model 
reduction technique is illustrated using a numerical example. 

Keywords Periodic descriptor systems • Lifted state space representation • 
Periodic projected Lyapunov equations • Balanced realization • Model reduction 



11.1 Introduction 

Linear discrete-time periodic descriptor systems have received a lot of attention 
over the last twenty years. They are suitable models for several natural as well as 
man-made phenomena, and have applications in modeling of periodic time- 
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varying filters and networks [16, 21], multirate sampled-data systems [16, 18], 
circuit simulation [3, 8, 10, 18], micro-electronics [19, 20], aerospace realm 
[34, 35], control of industrial processes and communication systems [1, 20]. 

A linear discrete-time periodic descriptor system with time-varying dimensions 
has the form 

EkXk+i = AkXk + BkUk, yk^CkXk, k e Z, (H-l) 

where Et G R'''+' ^"'-' , A^ e Rft+i ><"'■, Bk G ]R'''-+i^""-, Q e K^^"' are periodic with 

a period K>1 and J2k=o f-k = J2k=o "* — "■ The matrices £^- are allowed to be 
singular for all k. For system (11.1), a reduced-order model of dimension r would 
be a system of the form 

EkXk+i ^ Akh + BkUk, yk^CkXk, k e Z, (11-2) 

where Ek G »''+■ ^''+' , Ak G M''-+''"'*, Bk G ]R>''-+'^""-, Q G W'"''' are TiT-periodic 
matrices, J2k=o yk — J2k=o ^k ~ r and r <^ n. Apart from having a much smaller 
state-space dimension, it is also important that the reduced-order model preserves 
physical properties of the original system such as regularity, stability and passivity, 
and that the approximation error is small. 

The dynamics of the discrete-time periodic descriptor system (11.1) are often 
addressed by the regularity and the eigenstructure of the periodic matrix pairs 
{Ek,Ak}klQ ■ If all Ek are nonsingular, the eigenvalues (also called characteristic 
multipliers) of system (11.1) are given by the eigenvalues of the matrix product 

E^I^Ak-iE^\2Ak-2---E^,'Ao (11.3) 

associated with the periodic matrix pairs {£t, A^}^Jq . This product only yields a 
well-defined matrix if all Ej^ are nonsingular. Even if they are, the formulation of 
that matrix should be avoided for reasons of numerical stability. Note that even for 
some Ek being singular, we use (1 1.3) in a formal way to denote a generalization of 
matrix pencils to this periodic case (see [2] for details of this formal matrix product 
calculus). We compute the eigenvalues of (11.3) via the generalized periodic 
Schur decomposition [11, 32]. 

There exist unitary matrices Pk G C'''+' ""'''+' and Qk G C"""", with Qk+K = Qk 
such that the transformed matrices 

Nk = PlEkQk+i , Mk = PlAkQk, k^O,...,K~l, 

are all upper triangular, where for the ease of notation we allow complex arith- 
metic (in practice, however, computations can be performed in real arithmetic 
leading to quasi-triangular structure of one of the Mk). 
Then the formal matrix product 

Nk'-,Mk-iN^L2Mk-2 ■ --N^'Mo 
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has the same eigenvalues as (1 1.3). The blocks on the diagonals of the transformed 
matrices M/^ and A^^. are used to define the eigenvalues of the periodic matrix pairs 
{Ei^,Ait}i^^Q . A finite eigenvalue is given by 



1/ 



K-l {k) 

-- w 
k=o nil 



n- 



(k) 



„(*) 



provided n^ ^ for k = 0,...,K-l. Here m)'' e A(Mi) and n),' e A(A^i), 
where A denotes the eigenspectrum of the corresponding matrix. An eigenvalue is 

called infinite if Wi^Zq «// = 0, but Wi^Zq niu ^ 0. 

In this paper, we briefly review some basic concepts of discrete-time periodic 
descriptor systems (Sect. 11.2). In Sect. 11.3, we study the periodic reachability 
and observability Gramians from [6] using a lifted representation. A balanced 
truncation model reduction method for periodic descriptor systems is presented in 
Sect. 11.4. Section 11.5 contains a numerical example that illustrates the properties 
of the suggested model reduction technique. 



11.2 Periodic Descriptor Systems 



Lifted representations of discrete-time periodic systems play an important role in 
extending many theoretical results and numerical algorithms for time-invariant 
systems to the periodic setting [4, 7, 33]. We consider here the cyclic lifted 
representation which was introduced first for standard periodic systems in [17]. 
The cyclic lifted representation of the periodic descriptor system ( 11 . 1 ) is given by 



■^+1 = ja/;r^ + ^^k. 9k = "^'Xk, 



(11.4) 



where 



^ 



= diag(£o,£'i,---,£'/sr-i), 
••• Ao' 
Ai 







Ak-x 



'^ = 



diag(Bo,5i,...,Bjf_i) 
••• Co 
Ci 







Ck-x 



(11.5) 



The descriptor vector, system input and output of (11.4) are related to those of 
(11.1) via 



respectively. 
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A set of periodic matrix pairs {Ek,Ai;}^^Q is called regular if the pencil 
zS" — s/ is regular, i.e., det(z^ — ,s/) ^ 0. In this case, we can define a transfer 
function of the lifted system (11.4) as M'\z) = 'i^{zS' - .s^)'^^. 

The regular set of periodic matrix pairs {E^, A^tj^Jo can be transformed into a 
periodic Kronecker canonical form [6, 32]. For ^ = 0, 1,...,A'— 1, there exist 



nonsingular matrices Wit G 



''''+' and Zk e 



such that 



WkEkZk+i = 











WkAA - 



/„ 



(11.6) 



where Z^^ — Zq, ^k+K-\^k+K-2 • • '^^ = •/* is an «^ x «( matrix corresponding to 



the finite eigenvalues, E^E^^^ ■ ■ ■ E'' 
corresponding to an eigenvalue at infinity, n^ 



k^k+i---^k+K-i ^Nu is an nf 



xni 



nilpotent matrix 



d. 



and Hit 



+1 



^l+i 



The index v of the periodic descriptor system (11.1) is defined as v = 
max(vo, Vi, . . ., Vk-\), where v^ is the nilpotency index of A^^^.. Note that the finite 
eigenvalues of {Zi^, Aj^}j(.Jq coincide with the finite eigenvalues of the lifted pencil 
zS - .a/. 

For k = Q,\, . . .,K ~ 1, the matrices 



Pr{k) 



I f 


0" 


"i+1 











Pi{k) - w, 



7 , 


0" 


«A+1 











w,, 



are the spectral projectors onto the Ath right and left deflating subspaces of the 
periodic matrix pairs {£'i,,Aj:}^Jg corresponding to the finite eigenvalues, and 
Qr{k) = / ~ Pr{k) and Qi{k) = I — Pi{k) are the complementary projectors. Let 



for every A; = 0,1,...,A' 



1, the vector Z^ 'x^. 



WkBk 






CkZk 



- [ (4) ' ^^k 



■b\T]T 



and the matrices 



be partitioned in blocks conformally to the periodic matrix pairs {£yt,Ai}^jQ in 
(11.6). Under this transformation, system (11.1) can be decoupled into forward and 
backward periodic subsystems 






^k-^k+l ^ -^k 



Bf,. 



k'^k, 



B^Uk, 



A = c{4. 



yl 



r*x* 



(11.7) 
(11.8) 



respectively, withji^t — yk+y^k^k = Q,\, . . .,K ~ 1. The state transition matrix for 



the forward subsystem (11.7) is given by <^f{i,j) 






■ A I for / > j and 



<t>f{i,i) —If. For the backward subsystem (11.8), the state transition matrix is 
defined as ^^{ij) — E'^E'^^^- ■ ■ E''_^ for i<j and (bi,{i,i) — I„^. Using these 
matrices we can now define the forward and backward fundamental matrices of the 
periodic descriptor system (11.1) as 
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$/(/,;■+!) 




~MiJ) 






These fundamental matrices play an important role in the definition of the 
reachability and observability Gramians of the periodic descriptor system (11.1) 
that we will consider in the next section. 



11.3 Periodic Gramians and Matrix Equations 

The dynamics of the periodic descriptor system (11.1) are often addressed by the 
eigenstructure of the periodic matrix pairs {Zs^, A,t}jjg . 

Definition 1 The periodic matrix pairs {Zs^, A^j^^Jg are said to be periodic stable 
(pd-stable) if all the finite eigenvalues of {Ek,Aif}i^^Q lie inside the unit circle. 

In balanced truncation model reduction, Gramians play a fundamental role 
[15, 25, 28, 31]. For periodic descriptor system (11.1), the reachability and 
observability Gramians have been first introduced in [6]. 

Definition 2 Suppose that the periodic matrix pairs {is^., A^}^,Jq are pd-stable. For 
^ = 0, 1, . . ., A" — 1, we define the causal and noncausal reachability Gramians 
Gf and GT of system (11.1) as 

The complete reachability Gramian G[ is the sum of the causal and noncausal 
Gramians, i.e., G^ = G^' + G'l"' for fe = 0, 1, . . ., ^ - 1. 

Definition 3 For the pd-stable matrix pairs {EkTAk]i^Zo and A; = 0, 1, . . ., ^ — 1, 
the causal and noncausal observability Gramians Gf and G^™ of system (11.1) 
are defined as 



j=k j=k-vK 



k-l 

.k-\- 



The complete observability Gramian Gl is the sum of the causal and noncausal 
Gramians, i.e., Gl = Gl" + Gl'" for fc = 0, 1, . . ., ^ - 1. 

Note that the causal and noncausal Gramians correspond to the forward and 
backward subsystems (11.7) and (11.8), respectively. 
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11.3.1 Periodic Projected Lyapunov Equations 

It has been shown in [26] that the Gramians of discrete-time descriptor systems 
satisfy projected generalized discrete-time Lyapunov equations with special right- 
hand sides. A similar result also holds for periodic descriptor systems. 



Theorem 1 [6] Consider a periodic discrete-time descriptor system (11.1), where 

lk=0 



the periodic matrix pairs {Ek,Ak\i^^Q are pd-stable. 



1. For fe = 0, 1,...,^"— 1, the causal reachability and observability Gramians 

{^r}yt=o ^"^ {^™}/t=o '""^ ^^^ unique symmetric, positive semidefinite solu- 
tions of the projected generalized discrete-time periodic Lyapunov equations 
(PGDPLEs) 



(11.9) 



and 



A,Gl'-Al - £,G- ,£[ = -~P,{k)B,BlP,{kf, 
GY = P,{k)GYP,{k)\ 

AlGl%A, - ClCr^H = - Pr{kfClCkPr{k), 

Gl°^P,{k-\fGl°Pi{k-\), 

respectively, whereG^ = G'J, G™ = G{°,E_i = EK-x,andPi{-\) = P,{K - 1). 
2. For k — Q,\, . . .^K — \, the noncausal reachability and observability Grami- 
ans {G'l'^''}i^^Q and {G1™}i^^q are the unique symmetric, positive semidefinite 
solutions of the PGDPLEs 

AkGl'^Al - EuGl';,El - Q,{k)B,BlQi{k)\ 
Gr = Qr{k)GrQr{k)\ 

and 

AlGTM^^El.GrE,^, = QrikfclC,Q,{k) 

GT^Qiik-lfGrQiik-l), 
respectively, where G^ ^ G^" , G^' = Gg™ and Qi{-\) = Qi{K - 1). 

Numerical solution of the PGDPLEs has been considered in [6]. The method 
proposed there extends the periodic Schur method [5, 29, 30] and the generalized 
Schur-Hammarling method [23] developed for periodic standard and projected 
generalized Lyapunov equations, respectively. This method is based on an initial 
reduction of the periodic matrix pairs {Ek,Ak]i^ZQ to the generalized periodic 
Schur form [11, 32] and solving the resulting generalized periodic Sylvester and 
Lyapunov equations of (quasi)-triangular structure using the recursive blocked 
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algorithms [9]. Due to the computational complexity, the periodic generalized 
Schur-Hammarling method is restricted to problems of small and medium size. In 
[12, 27, 29], iterative methods based on Smith iterations [22] have been developed 
for periodic standard Lyapunov equations and also for projected generalized 
Lyapunov equations. These methods can also be extended to periodic projected 
Lyapunov equations by using the lifted representation of these equations. There- 
fore, we will discuss these lifted representations in the following subsection. 



11.3.2 Lifted Representation of Periodic Lyapunov Equations 

It is known that the Gramians of standard periodic systems satisfy the lifted form 
of the periodic Lyapunov equations and the solutions of these equations are 
diagonal matrices [12, 29]. 

The following theorem describes the block structures of the solutions of peri- 
odic Lyapunov equations in lifted form and their relations to the corresponding 
solutions of PGDPLEs in Theorem 1 . 

Theorem 2 Consider the periodic discrete-time descriptor system (11.1) and its 
lifted representation (11.4), where the periodic matrix pairs {E^jA^I^Jq are pd- 
stable. The causal and noncausal reachability Gramians '^" and ^'^" satisfy the 
lifted projected Lyapunov equations 



f'S".s4'^ - s'S"s'^ = -gPiM^'^0'], "S" = &>r's"0'l, 



(11.10) 



respectively, where S, s^ and 3S are as in (11.5) and 

^" = diag(Gr, . . ., Gl_,,G<^^\ T" = diag(Gr-'-, . . ., GT-i.G"^, 

i^, = diag(P,(0),P,(l),---,^/(^-l)), ^1=1-^1. .^^^j. 

^, = diag(P,(l),...,P,(^-l),P,(0)), J, = /-.^,. 

Proof We will only sketch the proof due to space limitation of the paper. Using 
the block structure of matrix coefficients, a straightforward computation shows that 
the projected Lyapunov Equation (11.10) is equivalent to the periodic projected 
Lyapunov Equation (11.9). Since the periodic matrix pairs {Ek,Ait}i^Z{) are pd- 
stable, the pencil zS — ,«/ is regular and all its eigenvalues lie inside the unit circle. 
Then (11.10) has a unique solution [24]. The proof for 'S"" can be treated 
similarly. D 

For the observability Gramians, the situation becomes a bit more complex. The 
reason is that we do not want to destroy the block diagonal structure of the lifted 
solutions and we would like to use the lifted solution to find a balanced realization 
of the original system. 
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Theorem 3 Consider the periodic discrete-time descriptor system (11.1) and its 
lifted representation (11. 4). The causal and noncausal observability Gramians ^™ 
and ^"'^° satisfy the lifted projected Lyapunov equations 

respectively, where S and ,«/ are as in (11.5), the projectors SPi, SP,-, J2; and Sir 
are as in (11.11), '^ = diag(Ci, . . ., Cjf_i, Co) and 

^^^ = diag(Gr , . . ., G™_, , GS"), ^"^^ = diag(Gr, • • -, GT-, , GD- 
Proof The proof is analogous to the previous proof of Theorem 2. D 



11.3.3 Hankel Singular Values 

Let the periodic matrix pairs {Ek,Ak}i^I(-^ be pd-stable. Then the causal and non- 
causal matrices Ml = G'{El_^Gl°Ek-x and Mf = G'l"'AlGf°^Ak, yt == 0, 1, . . ., 
K — \, have real and nonnegative eigenvalues. These eigenvalues are used to define 
the causal and noncausal Hankel singular values of system (11.1). 

Definition 4 Let the set of periodic matrix pairs {Ek:Ak}i^Z{i be pd-stable. For 
^ = 0,1,..., A"— 1, the square roots of the largest n{ eigenvalues of the matrix Ml, 
denoted by Gj^j, are called the causal Hankel singular values and the square roots 
of the largest m^ eigenvalues of M'l'^, denoted by Oj^j, are called the noncausal 
Hankel singular values of the periodic descriptor system (11.1). 

Since the causal and noncausal reachability and observability Gramians are 
symmetric and positive semidefinite, there exist the Cholesky factorizations 

Gl'^RkR,,, Gl° = L^Lk, Gl^^RkR^, G"^^" ^ Lj^Lu, (11-12) 

Simple calculations show that dkj — Lj{LkEk-iRk) and Okj — t^j{Lk+iAiiRk) , 
where I^j{.) denotes the singular values of the corresponding matrices. 



11.4 Balanced Truncation Model Reduction 

In this section, we present a generalization of a balanced truncation model 
reduction method to periodic descriptor systems. For a balanced system, the 
reachability and observability Gramians are both equal to a diagonal matrix 



1 1 Model Reduction of Periodic Descriptor Systems Using Balanced Trancation 



201 



[15, 25]. Balanced realizations for periodic descriptor system have been consid- 
ered in [6]. 

Definition 5 A realization {Ek,Aii,Bic,Ci() of a periodic descriptor system ( 11 . 1 ) is 
called balanced if 






0' 



' Gk = 


- ^k+i - 






" 
0*. 



where S^ = diag(ff;t,i , . . ., ff^ ,^/) and ©^ = diag(0^,i , . . ., 0^,,,-), k ^ 0,1, . . .,K ~ 1. 

Consider the Cholesky factorizations (11.12) of the reachability and observ- 
ability Gramians and let 



LkEk-iRk — Uk^kVi , Lk+iAkRk — Uk&kV,^ 



(11.13) 



be the singular value decompositions of the matrices LkEk-iRk and Lk+\AkRk for 
A: = 0, 1,...,^— 1. Here Uk,Vk,Uk, Vk are orthogonal, and E^ and 0^ are diag- 
onal. If a realization {Ek,Ak,Bk,Ck) with pd-stable matrix pairs {Ek,Ak}i^ZQ is 
minimal, i.e., "Lk and 0^ are nonsingular, then there exist nonsingular periodic 
matrices 



-1/2 



Sk - [h+\^k+l'^k+{', Ll^iUk&k 



Pl-l/^l 



Tk 



[RkVkI^^"\ 



RkVkev''^] 



such that the transformed realization {SjEkTii+\,SlAi,Tk,SlBk,CkTk) is balanced 
[6]. Note that as in the case of standard state space systems, the balancing 
transformation matrices for periodic discrete-time descriptor system (11.1) are not 
unique. 

Model reduction via balanced truncation is discussed very widely for standard 
discrete-time periodic systems [12, 31] and also for continuous-time descriptor 
systems [14, 25]. For a balanced system, truncation of states related to the small 
causal Hankel singular values does not change system properties essentially. 
Unfortunately, we can not do the same for the noncausal Hankel singular values. If 
we truncate the states that correspond to the small non-zero noncausal Hankel 
singular values, then the pencil for the reduced-order system may get finite 
eigenvalues outside the unit circle that will lead to additional errors in the system 
approximation. 

Assume that the periodic matrix pairs {Ek,Ak}kIo ^^^ pd-stable. Consider the 
Cholesky factorizations in (11.12). Let 



LkEk-iRii — [f/i-,1, Vk.-A 



^k.\ 



-Kl 



[Vi,i,VM]' 



U^xAkRk = Uk&kVl, 
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be singular value decompositions of LkEk-\Rk and Lk+\AkRk, where 
E*,i =diag((7i,i,...,(7^/), Zo = diag((Tj,/ ,...,(!,/), 

^ k ' i+l k 

with OkA > ■■■ > ffi^j > a^./ >■■■> (7;, ,/ > 0, and &k = diag(6'i,i , . . ., 0^,;.^ ) is 

nonsingular for A; = 0, 1,...,^"— 1. Then the reduced-order system can be com- 
puted as 

Ek — Sj^^^EkTk+i^r, All = Si^/ikTk,r, Bk — Sj^^Bk, Ck — CkTk.r, (11-14) 
where 

Sk, =[Ll^,Uk+i^^-kl{\,Ll+fik®-k"^] e M"-'--', 

•-;/^ RkVk&; 



Tk, =[RkVk,i^k \'\ RkVk®;"^] e M"-'-s 



with rk = 'k^ ''T- L^*^ -^(z) be the transfer function of the reduced-order lifted 
system formed from the reduced-order subsystems in (11.14). Then we have the 
following Hoc -norm error bound 

K-l 

\W-^\W^= sup ||.jr(e'»)-i^V™)ll2<2 5Itrace(S,,2), (11.15) 

foe[0,27t] ^=0 

where ||.||2 denotes the matrix spectral norm and Zt2 contains the truncated causal 
Hankel singular values. This error bound can be obtained similarly to the standard 
state space case [13, 31]. 



11.5 Example 

We consider a periodic discrete-time descriptor system with li^^—nk — 10, 
n — \{), m — 2, pk = 3, and period A' = 3 as presented in [6, Example 1]. The 
periodic matrix pairs {£/t,A,t}j(.Jo are pd-stable with «{ = 8 and Uk — 2 for 
^ = 0, 1,2. The norms of the computed solutions of the periodic Lyapunov 
equations and the corresponding residuals, e.g., 

p'l = WAkG'lAl - EkGl^X + Pi(k)BkBlPi{kY\\^, 

are shown in Tables 11.1 and 11.2. 

Table 11.1 Norms and relative residuals for the reachability Gramians 



k 


II /"t-'r II 
II '-'k l|2 


pY 


II r't^cr II 

II W lb 


pT 





5.8182 X 10^ 


6.1727 X 10"'^ 


1.3946 X 10' 


1.5444 X 10""' 


1 


8.2981 X lO'* 


8.2172 X 10"'^ 


1.3660 X 10' 


1.7508 X 10-'" 


2 


7.1107 X 10^ 


3.0961 X 10"'^ 


1.4308 X 10' 


3.3847 X 10-"* 
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k 


II /"t-'CJ II 

II ^k l|2 


pT 


II r'nco II 

II ^k lb 


nca 
Pk 





9.7353 X 10' 


l.l&li X 10"" 


1.6866 X 10" 


1.3372 X 10"'^ 


1 


1.1373 X 10^ 


7.7003 X 10"'^ 


1.7406 X 10" 


2.1113 X 10"'^ 


2 


9.6984 X 10" 


1.7859 X 10"'^ 


1.6866 X 10" 


1.1626 X IQ-'^ 
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Fig. 11.1 a Causal Hankel singular values of different subsystems, b finite eigenvalues of the 
original and the reduced-order lifted systems 



The original lifted system has order n — 30. Figure 11.1a shows the causal 
Hankel singular values of the different subsystems for A: = 0, 1, 2. We see that 
they decay fast, and, hence system (11.1) can be well approximated by a reduced- 
order model. We have 24 causal Hankel singular values for the original lifted 
system and the remaining 6 are noncausal Hankel singular values which are 
positive. We approximate system (11.1) to the tolerance 10~^ by truncating the 
states corresponding to the smallest 7 causal Hankel singular values. 

Figure 11.1b shows the finite eigenvalues of the original and reduced-order 
lifted systems. We observe that stability is preserved for the reduced-order system. 
In Fig. 11.2a, we present the norms of the frequency responses .J^(e'™) and 
^(e'™) of the original and reduced-order lifted systems for a frequency range 
[0, 2n]. We observe nice match of the system norms. 

In Fig. 11.2b, we display the absolute error || Jf (e'™) — .^'{e"")\\2 and the error 
bound (11.15). One can see that the absolute error is smaller than the error bound. 



11.6 Conclusion 



In this paper, we have considered the reachability and observability Gramians as 
well as Hankel singular values for periodic discrete-time descriptor systems. For 
such systems, a balanced truncation model reduction method has been presented. 
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a The frequency responses of the original and the reduced-order lifted systems; 
error and error bound 



The proposed method delivers a reduced-order model that preserves the regularity 
and stability properties of the original system. A computable global error bound 
for the approximate system is also available. 
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Chapter 12 

On Synthesis of Reduced Order Models 



Roxana lonutiu and Joost Rommes 



Abstract A framework for model reduction and synthesis is presented, which 
enables the re-use of reduced order models in circuit simulation. Two synthesis 
techniques are considered for obtaining the circuit representation (netlist) of the 
reduced model: (1) by means of realizing the reduced transfer function and (2) by 
unstamping the reduced system matrices. For both methods, advantages and 
limitations are discussed. Especially when model reduction exploits structure 
preservation, we show that using the model as a current-driven element is possible, 
and allows for synthesis without controlled sources. The presented framework 
serves as a basis for reduction of large parasitic R/RC/RCL networks. 
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12.1 Introduction 



R. lonutiu and J. Rommes 



The main motivation for this chapter comes from the need for a general framework 
for the (re)use of reduced order models in circuit simulation. Although many 
model order reduction methods have been developed and evolved since the 1990s 
(see for instance [1] for an overview), it is usually less clear how to use these 
methods efficiently in industrial practice, e.g., in a circuit simulator. One reason 
can be that the reduced order model does not satisfy certain physical properties, for 
instance, it may not be stable or passive while the original system is. Failing to 
preserve these properties is typically inherent to the reduced order method used (or 
its implementation). Passivity (and stability implicitly) can nowadays be preserved 
via several methods [2, 10, 16, 19, 21, 26, 27], but none address the practical 
aspect of (re)using the reduced order models with circuit simulation software (e.g., 
SPICE [9]). This brings forward another reason of concern within the circuit 
simulation industry. The linear circuit to be reduced is represented by a netlist, 
which is a description of the circuit element values (R, L, C) and their connections 
to the circuit nodes (see also Fig. 12.1). However, reduced order models (as a 
result of model reduction applied on the dynamical system describing the original 
circuit) are usually represented in terms of system matrices or of the input-output 
transfer function. Typically, circuit simulators are not prepared for inputs in a 
mathematical representation, and would require additional software architecture to 
handle them. In contrast, a reduced model in netlist representation could be easily 
coupled to bigger systems and directly simulated. 

Synthesis is the realization step needed to map the reduced order model from 
the mathematical representation (in terms of system matrices or transfer function) 
into a netlist consisting of electrical circuit components [13]. In [7] it was shown 
that passive systems (with positive real transfer functions) can be synthesized with 
positive R, L, C elements and transformers (see also [23]). Later developments [6] 
propose a method to circumvent the introduction of transformers, however the 
resulting realization is non-minimal (i.e., the number of electrical components 
generated during synthesis is too large). Allowing for possibly negative 
R, L, C values, other methods have been proposed via e.g. direct stamping [18, 19] 
or full realization [14, 20]. These mostly model the input/output connections of the 
reduced model with controlled sources. 



Fig. 12.1 Circuit with 
terminal a, internal node 1, 
port P, and port Q(a,0) 
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In this paper we consider two synthesis methods that do not involve controlled 
sources: (1) Foster synthesis [13], where the realization is done via the system's 
transfer function and (2) RLCSYN synthesis by unstamping [29], which exploits 
input-output structure preservation in the reduced system matrices (provided that 
the original system matrices are written in modified nodal analysis (MNA) rep- 
resentation). The focus of this paper is on structure preservation and RLCSYN, 
especially because synthesis by unstamping is simple to implement for both SISO 
and MIMO systems. Strengthening the result of [29], we give a simple procedure 
to reduce either current- or voltage-driven circuits directly in impedance form, 
enabling synthesis without controlled sources. The reduced order model is avail- 
able as a netlist, making it suitable for simulation and reuse in other designs. 
Similar software [8] is commercially available. 

The material in this chapter is organized as follows. The remainder of this 
section introduces terminology for the different nodes pertaining to a circuit 
topology (Sect. 12.1.1). A brief mathematical formulation of model order reduc- 
tion is given in Sect. 12.1.2. The Foster synthesis is presented in Sect. 12.2. In 
Sect. 12.3 we focus on reduction and synthesis with structure (and input/output) 
preservation. Section 12.3.1 describes the procedure to convert admittance models 
to impedance form, so that synthesized models are easily (re)used in simulation. 
Based on [29], Sect. 12.3.2 is an outline of SPRIM/IOPOR reduction and 
RLCSYN synthesis. Numerical considerations on SPRIM/IOPOR are covered in 
Sect. 12.3.3. Experiments follow in Sect. 12.4, and Sect. 12.5 concludes. 



12.1.1 Internal Nodes, Terminals, and Ports 

The terms internal nodes, terminals (or external nodes), and ports often occur in 
electronic engineering related papers. An internal node is a node in a circuit that is 
not visible on the outside of a circuit, i.e., no currents can be injected in an internal 
node (cf. node 1 in Fig. 12.1). A terminal (external node) is a node that is visible 
on the outside, i.e., a node in which currents can be injected (cf. node a in 
Fig. 12.1). A port consists of two terminals that can be connected, for instance, by 
a source or another (sub)circuit (cf. port P in Fig. 12.1). Sometimes terminals are 
referred to as ports and vice versa: from the context it should then be clear which 
terminal(s) complete the ports; usually it is implicitly assumed that the ground 
node completes the ports. In Fig. 12.1, for instance, terminal a can be seen as a 
port (Q) by including the ground node. 



12.1.2 Problem Formulation 

In this paper the dynamical systems E(A,E,B,C,D) are of the form 

Ex(r) = Ax(r) +Bu(f), y(?) = Cx(r) +Du(f), where A,E e W"", E may be 
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singular but the pencil (A,E) is regular, B G M'""", C £ M''""', x(r) G K", and 
u(r) e M'", y(f) G M'', D G W"""'. If m, p > 1, tiie system is called multiple-input 
multiple-output (MIMO), otiierwise it is called single-input single-output (SISO). 
The frequency domain transfer function is defined as fl{s) = C(5E — A)^ B + D. 
For systems in MNA form arising in circuit simulation see Sect. 12.3 

The model order reduction problem is to find, given an n-th order (descriptor) 
dynamical system, a A:-th order system, k<n: Ex(f) = Ax(f) + Bu(f), y{t) = 
Cx(?) + Du(r), such that the output approximation error ||y(f) — y(f)ll (in appro- 
priate norm) is small. The corresponding dimensions are: E, A G M*^*^, B G S}^"", 

C G R'""*, x(f) G R*, u(/) G R"', fit) G R'', and D G R'""". The number of inputs 
and outputs is the same as for the original system, and the corresponding transfer 

function becomes: H(s) = C(iE — A)^ B + D. For an overview of model order 
reduction methods, see [1, 5, 25]. Projection based model order reduction methods 
construct a reduced order model via the Petrov-Galerkin projection: 



E(E,A,B,C,D) = (W*EV,W*AV,W*B,CV,D), (12.1) 

where V, W G R"^™ are matrices whose k <^ n columns form bases for relevant 
subspaces of the state-space. There are several projection methods, that differ in 
the way the matrices V and W are chosen. These also determine which properties 
are preserved after reduction. Some stability preserving methods are: modal 
approximation [24], poor man's TBR [22]. Among moment matching [11] meth- 
ods, the following preserve passivity PRIMA [19], SPRIM [10], spectral zero 
interpolation [2, 16, 26]. From the balancing methods, balanced truncation [4] 
preserves stability, and positive real balanced truncation [21, 27] preserves 
passivity. 



12.2 Foster Synthesis of Rational Transfer Functions 

This section describes the Foster synthesis method, which was developed in the 
1930s by Foster and Cauer [13] and involves realization based on the system's 
transfer function. The Foster approach can be used to realize any reduced order 
model that is computed by standard projection based model order reduction 
techniques. Realizations will be described in terms of SISO impedances 
(Z-parameters). For equivalent realizations in terms of admittances (K-parameters), 
see for instance [13, 28]. Given the reduced, proper system (12.1) and assuming 
that all its finite poles are simple [i.e., the matrix pencil (A, E) is non-defective], 
consider the partial fraction expansion [17] of its transfer function: 
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Fig. 12.2 Realization of a general impedance transfer function as a series RLC circuit 



Table 12.1 Circuit element values for Fig. 12.2 for the Foster impedance realization of (12.3) 
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(12.2) 



the poles are p, and, if non-zero, D gives 



additional contribution from poles at 00. An eigentriplet (/j,, x,-,y,) is composed of 
an eigenvalue pj of (A,E) and the corresponding right and left eigenvectors 
x,-,y,- £ C*. The proper expansion (12.2) belongs to the more general class of 
transfer functions (not necessarily proper) which can be expressed using basic 
summands of the form: 
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(12.3) 



For completeness, in (12.3) we can assume that any kind of poles may appear, i.e., 
either purely real, purely imaginary, in complex conjugate pairs, at 00 or at (see 
also Table 12.1). The Foster realization converts each term in (12.3) into the 
corresponding circuit block with R, L, C components, and connects these blocks in 
series in the final netlist. This is shown in Fig. 12.2. Note that any reordering of 
the circuit blocks in the realization of (12.3) in Fig. 12.2 still is a realization of 
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(12.3). The values for the circuit components in Fig. 12.2 are determined 
according to Table 12.1. 

The realization in netlist form can be implemented in any language such as 
SPICE [9], so that it can be reused and combined with other circuits as well. The 
advantages of Foster synthesis are: (1) its straightforward implementation for 
single-input-single-output (SISO) transfer functions, via either the impedance or 
the admittance transfer function, (2) after reducing purely RC or RL circuits via 
modal approximation [24], the reduced netlists obtained from Foster synthesis are 
guaranteed to have positive RC or RL values respectively (see [15] for a proof). 
Note however that Foster synthesis does not guarantee positive circuit elements in 
general (e.g., when used to synthesize reduced models originating from RLC 
circuits, or reduced models ofRC and RL circuits that were obtained with methods 
different than modal approximation). The main disadvantage is that for multi- 
input-multi-output transfer functions, finding the Foster realization (see for 
instance [28]) is cumbersome and may also give dense reduced netlists (i.e., all 
nodes are interconnected). This is because the Foster synthesis of a A:-dimensional 
reduced system with p terminals, will generally yield 0{p k) circuit elements. 



12.3 Structure Preservation and Synthesis by Unstamping 

This section describes the second synthesis approach, which is based on un- 
stamping the reduced matrix data into an RLC netlist and is denoted by RLCSYN 
[29]. It is suitable for obtaining netlist representations for models reduced via 
methods that preserve the MNA structure and the input-output connectivity at the 
circuit terminals. Such a method is the input-output structure preserving method 
SPRIM/IOPOR [29]. The strength of the result in [29] is that the input/output 
connectivity is synthesized after reduction without controlled sources, provided 
that the system is in impedance form (i.e., inputs are currents injected into the 
circuit terminals, and outputs are voltages measured at the terminals). For ease of 
understanding, the input-output structure preserving reduction from [29] can be 
interpreted as model reduction with preservation of external nodes [e.g., after 
reducing the circuit in Fig. 12.1, the nodes forming ports Q and P are external 
(terminals) and will also appear in the synthesized reduced model]. This way the 
reduced netlist can be easily coupled to other circuitry in place of the original 
netlist, and (re)using the reduced model in simulation becomes straightforward. 
The main drawback is that, when the reduced system matrices are dense and the 
number of terminals is large [(9(10 )], the RLCSYN unstamping procedure yields 
dense netlists. For a k dimensional reduced network with p terminals, the RLC- 
SYN synthesized netlist will generally have 0{p^k^) circuit elements. 

First, we motivate reduction and synthesis in impedance form, and show how it 
also applies for systems that are originally in admittance form. Then we explain 
model reduction via SPRIM/IOPOR, followed by RLCSYN synthesis. Finally, a 
note on numerical aspects concerning the SPRIM/IOPOR projection is given. 
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12.3.1 A Simple Admittance to Impedance Conversion 

In [29] it was shown how SPRIM/IOPOR preserves the structure of the input/ 
output connectivity when the original model is in impedance form, allowing for 
synthesis via RLCSYN without controlled sources. The emerging question is: how 
to ensure synthesis without controlled sources, if the original model is in admit- 
tance form (i.e., it is voltage driven)? We show that reduction and synthesis via the 
input impedance transfer function can be performed, also when the original circuit 
is initially voltage driven. 

Consider the modified nodal analysis (MNA) description of an input admit- 
tance^ type RLC circuit, driven by «, voltage sources: 

^ -A 





(12.4) 

where u(/) € R"' are input voltages and y(f) = Cyx(f) e M"' are output currents, 
Cy =B^. The states are Xi'(f)^ = [\{t),is{t),iL{t)f, with v(f) e M"' the node 
voltages, is{t) G K"' the currents through the voltage sources, and {^{t) e W' the 
currents through the inductors, iiy + n^ + ni = n. The riy = rii + n2 node voltages 
are formed by the rii external nodes/terminals^ and the ^2 internal nodes (assuming 
that the voltage sources may be ungrounded, n^ satisfies: «j<«i <2«j + 1). The 
dimensions of the underlying matrices are: -W G C"'"""',^ e C">'^"%^v, € C"^"""', 
^ eC"""",(f/ eC"''"",.^€C""'"'. Assuming without loss of generality that 
(12.4) is permuted such that the first rii nodes are the external nodes, the input 
voltages are determined by a linear combination of Vi:„,(f) only. Thus the fol- 
lowing holds: 

eC""^"% i#, eC"'^"% ^ = -J'„. (12.5) 

We derive the first order impedance-type system associated with (12.4). Note 
that by definition, is(/) flows out o/the circuit terminals into the voltage source 
(i.e., from the + to the — terminal of the voltage source, see also [19, Fig. 3], [15]). 



The subscript Y refers to quantities associated with a system in admittance fonn. 
^ The MNA form (12.4) corresponds to the ungrounded circuit (i.e., the reference node is 
counted within the n\ external nodes), resulting in a defective matrix pencil (Ay,Ey). For 
subsequent computations such as the construction of a Krylov subspace, the pencil (Ay,Ey) 
must be regular. Thus in (12.4), one node must be chosen as a ground (reference) node by 
removing the row/column corresponding to that node; this ensures that the regularity conditions 
(i) and (ii) from [23, page 5, Assumption 4] are satisfied. The positive definiteness of 'V, i?, ^ is 
also a necessary condition to ensure the circuit's passivity. 
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We can define new input currents as the currents flowing into tiie circuit ter- 
minals: i,„(?) = ~is{t)- Since Vi:„,(?) are the terminal voltages, they describe the 
new output equations, and it is straightforward to rewrite (12.4) in the impedance 
form: 



^ V/y(f)\ / ^ ^A fy{t)\ ^[6% 
^J dt\iLit)J^\-S'i* Oj{h{t)J {o 



B 



^ ) ( kit) ) = y(') ^ ^"^'^«' (')' '^*- = (^^'*« 



(12.6) 



where B describes the new input incidence matrix corresponding the input cur- 
rents, i,„. The new output incidence matrix is C, corresponding to the voltage drops 
over the circuit terminals. We emphasize that (12.6) has fewer unknowns than 
(12.4), since the currents ij have been eliminated. The transfer function associated 

to (12.6) is an input impedance: Yi{s) = ?^K. In Sect. 12.3.2 we explain how to 
obtain an impedance type reduced order model in the input/output structure pre- 
served form: 




A " B 



(12.7) 



where '^ , if, 'S , Sv are the reduced MNA matrices, and the reduced input 
impedance transfer function is: H(i') = prjr . Due to the input/output preservation, 

the circuit terminals are easily preserved in the reduced model (12.7). 

It turns out that after reduction and synthesis, the reduced model (12.7) can 

still be used as a voltage driven admittance block in simulation. This is shown 

next. We can rewrite the second equation in (12.7) as: {—S'* O) 

(v(/)^ Ts(f)^ hitfY=^u{t). This result together with i,„(f) = -is(0, 
reveals that (12.7) can be rewritten as: 




















^ 




(12.8) 
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which has the same structure as the original admittance model (12.4). Conceptu- 
ally one could have reduced system (12.4) directly via the input admittance. In that 
case, synthesis by unstamping via RLCSYN [29] would have required controlled 
sources [14]. As shown above, this is avoided by: applying the simple admittance- 
to-impedance conversion (12.4)-(12.6), reducing (12.6)-(12.7), and finally rein- 
serting voltage sources after synthesis (if the input-output structure preserved 
admittance reduced admittance (12.8) is needed). 



12.3.2 I/O Structure Preserving Reduction 
and RLCSYN Synthesis 

The reduced input impedance model (12.7) is obtained via the input-output structure 
preserving SPRIM/IOPOR projection [29] as follows. Let V= (V[,V[,V^)^e 
(j~.((«,+n2+«/)x*) ^g jjjg projection matrix obtained with PRIMA [19], where 

Vi G C*"""*', V2 e C^"-'"'^', V3 e C'-""'''\k>ni, i=l.. .3. After appropriate ort- 
honormalization (e.g., via Modified Gram-Schmidt [24, Chapter. 1]), we obtain: 
V, = orth(V,) e C"'""*', ki<k. The SPRIM [10] block structure preserving pro- 
jection is: V = blkdiagf Vi, V2, V3 ] £ cnx(ki+k2+k3)^ which does not yet preserve 
the structure of the input and output matrices. The input-output structure preserving 
SPRIM/IOPOR [29] projection isW==(Y ^)e C'"''"'+^-+^'' where: 



V 



3_ 
^^ /I«ix«i y \ ^ ^(ni+«2)x(«i+<:2)^ (12.9) 

Recalling (12.5), we obtain the reduced system matrices in (12.7): ''€ — W*'ifW, 

#==w*^w, 5^ = v;ifV3, ?/ = w*^/V3, ?„ = w*A, = (.^: o„,.t,)*, 

which compared to (12.5) clearly preserve input-output structure. Therefore a 
netlist representation for the reduced impedance-type model can be obtained, that 
is driven by injected currents just as the original circuit. This is done via the 
RLCSYN [29] unstamping procedure. To this end, we use the Laplace transform 
and convert (12.7) to the second order form: 

r[,i- + i+if]y(,) = ?„i,-„(..) 

where Ti,(i) = f^-^lTAvis) and f = ?,5'-'?;. 

The presentation of RLCSYN follows [29, Sect. 4], [15] and is only summa- 
rized here. In circuit simulation, the process of forming the "^j ^, if system 
matrices from the individual branch element values is called "stamping". 
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The reverse operation of "unstamping" involves decomposing entry-wise the 
values of the reduced system matrices in (12.10) into the corresponding R, L, and 
C values. The resulting Rs, is and Cs are connected in the reduced netlist 
according to the MNA topology. The reduced input/output matrices of (12.10) 
directly reveal the input connections in the reduced model via injected currents, 
without any controlling elements. The prerequisites for an unstamping realization 
procedure therefore are: 

1. The original system is in MNA impedance form (12.6). If the system is of 
admittance type (12.4), apply the admittance-to-impedance conversion from 
Sect. 12.3.1. 

2. In (12.6), no Ls are directly connected to the input terminals so that, after 
reduction, diagonalization and regularization preserve the input/output 
structure. 

3. System (12.6) is reduced with SPRIM/IOPOR [28] to (12.7) and converted to 
second order form (12.10). The alternative is to obtain the second order form of 
the original system first, and reduce it directly with SAPOR/IOPOR [3, 29]. 

4. The reduced system (12.10) must be diagonalized and regularized according 
to [29]. Diagonalization ensures that all inductors in the synthesized model are 
connected to ground (i.e., there are no inductor loops). Regularization elimi- 
nates spurious over-large inductors. These steps however are not needed for 
purely RC circuits. 



12.3.3 On SPRIM/IOPOR and Rank Loss of A 

In some cases it was observed (and shown by results in Sect. 12.4.2) that models 
reduced with SPRIM or SPRIM/IOPOR exhibit poles and zeros at 0. This section 
explains when this happens and supports theoretically the interpretation of the 
results in Sect. 12.4.2 

Proposition 1 Let Wand V3 be the SPRIM/IOPOR projection matrices (12.9), 
with full column rank. If Si = W*(f /V3 in (12.3.2) has deficient column rank, then 
the SPRIM/IOPOR reduced model (12.7) has at least a pole-zero pair at 0. 

Proof Recalling that V2 has full column rank, it is clear from (12.9) that W also 
has full column rank. Nonetheless W* G ([<"^+i<2)y{nx+n2) j^^^ ^^^^^ columns than 
rows (usually ^2 "C «2)> thus W* is column rank deficient. Hence Si = W*(f;V3 
will have deficient column rank (even if Si and V3 have full column rank). Then in 
(12.7) we have: rank(A) < #cols(A). Thus A has deficient column rank =^ A has 
at least an eigenvalue at ^ is also an eigenvalue of the pencil (A, E) and this is 
a pole at 0. The zeros of the reduced system (12.7) are determined via the 
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Rosenbrock matrix [23, Chapter 5]: A; = 



-SI 



which immediately reveals that if Si is column rank deficient, then A, also loses 
rank and will have a eigenvalue. Therefore the SPRIM/IOPOR reduced system 
(12.7) will also have a zero at 0. D 

Analytically, pole-zero pairs at are harmless since they theoretically cancel. 
Numerically this may not be the case, altering the approximation for low fre- 
quencies (as seen for instance in the result in Sect. 12.4.2). 



12.4 Numerical Examples 

Two circuits are chosen to demonstrate the applicability of the model reduction 
and synthesis procedures treated in this paper. For the single-input-single-output 
(SISO) example, one can easily provide synthesized models via both Foster and 
RLCSYN. For the multi-input-multi-output (MIMO) example, a synthesized 
model can be obtained straightforwardly with RLCSYN, thus JiLCSYN synthesis 
is preferred over Foster synthesis. 



12.4.1 SISO RLC Network 

We reduce the SISO RLC transmission line in Fig. 12.3. Note that the circuit is 
driven by the voltage u, thus it is of admittance type (12.4). The admittance 
simulation of the model reduced with the dominant spectral zero method (Domi- 
nant SZM) [16], synthesized with the Foster approach, is shown in Fig. 12.5. The 
behavior of the original model is well approximated for the entire frequency range, 
and can also reproduce oscillations at dominant frequency points. 

The benefit of the admittance-to-impedance transformation described in 
Sect. 12.3.1 is seen in Fig. 12.6. By reducing the system in impedance form with 
SPRIM/IOPOR and synthesizing (12.7) (using the second order form (12.10)) with 
RLCSYN [29], we are able to recover the reduced admittance (12.8) as well. The 
approximation is good for the entire frequency range. 
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Fig. 12.3 Transmission line from Sect.12.4.1 
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12.4.2 MIMO RLC Network 

We reduce the MIMO RLC netlist resulting from the parasitic extraction [12] of 
the coil structure in Fig. 12.4. The model has 4 pins (external nodes). Pin 4 is 
connected to other circuit nodes only via C's, which causes the original model 
(12.6) to have a pole at 0. The example shows that: (1) the SPRIM/IOPOR model 
preserves the terminals and is synthesizable with RLCSYN without controlled 
sources, and (2) structure preserving projections may affect numerical cancella- 
tions of pole-zero pairs at 0. 



Fig. 12.4 Coil structure 
from Sect. 12.4.2 




Fig. 12.5 Input admittance 
transfer function: original, 
reduced with Dominant SZM 
in admittance form and 
synthesized with Foster 
admittance 



Input admittance simulation 
Dominant SZIVI 



= 0.22513 




Original 

Recluced(Modal approx.) 
Reduced(Dominant SZM) 
Dominant SZM-synthesized 



-4-2024 

Frequency (rad/s) 
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Fig. 12.6 Input admittance 
transfer function: original and 
synthesized SPRIM/IOPOR 
model (via impedance), after 
reconnecting the voltage 
source at the input terminal 



Input admittance simulation 
SPRIM/IOPOR 




Original 
- SPRIM/IOPOR-synthesized 



-4-2024 

Frequency (rad/s) 



Fig. 12.7 Input impedance 
transfer function with "V4" 
kept: H44 for PRIMA, 
SPRIM/IOPOR and 
RLCSYN realization 
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Input impedance simulation - H^^ 

PRIMA , SPRIM/IOPOR 

n = 220,k„„„OPop, = 24 

Original 

Reduced: PRIMA 

Reduced: SPRIM/IOPOR 

-X- RLCSYN: SPRIM/IOPOR 




2 4 6 

Frequency (rad/s) 



Fig. 12.7 shows the simulation of the transfer function from input 4 to output 4, 
which clearly reflects the presence of the pole at due to the large response 
magnitude for low frequencies. By inspecting the response from input 3 to output 3 
however, we notice in Fig. 12.8 that the SPRIM/IOPOR model is less accurate 
around DC than PRIMA. The SPRIM/IOPOR (12.9) projection approximates the 
original pole at by a small pole (not at 0), but still places zeros at due to the 

dependencies created in the Rosenbrock matrix A^ (see Sect. 12.3.3). Such pole- 
zero pairs can no longer cancel numerically, affecting the approximation quality 
for low frequencies. 
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Fig. 12.8 Input imped 
transfer function with ' 
kept: H33 for PRIMA, 
SPRIM/IOPOR and 
RLCSYN realization 
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Fig. 12.9 Transient 
simulation with "V4" 
grounded: voltage measured 
at node 2 for SPRIM/IOPOR 
(from RLCSYN realization) 



Time simulation 



-V„ 



CD 
T3 

D) ■ 

CO 



SPRIM/IOPOR, avc4 grounded 
n = " " 




The alternative is to ground pin 4 prior to reduction. This will remove the pole 
at from the original model, resolve the corresponding numerical cancellation 
effects, and improve approximation around DC. As seen from Fig. 12.10, SPRIM/ 
lOPOR applied on the remaining 3-terminal system gives a better approximation 
than PRIMA for the entire frequency range. The transient simulation in Fig. 12.9 
confirms that the SPRIM/IOPOR model is both accurate and stable. With pin 4 
grounded however, we loose the ability to (re)connect the synthesized model in 
simulation via all the terminals . 
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Fig. 12.10 Input impedance 

transfer function with "V4" 
grounded: H33 for PRIMA, 
SPRIM/IOPOR and 
RLCSYN realization 



Input impedance simulation - H^^ 
PRIMA, SPRiM/IOPOR, avc4 grounded 
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Table 12.2 Summary and comparison of synthesis methods 



To Projection and Foster realization of 

summarize H(i) 



I/O pres. SPRIM and RLCSYN unstamping 



Properties Passivity guaranteed by positive 

realness of H(.s) from appropriate 
V, W 
Advantages Z need not be in MNA form 
RC, RL reduced with inodal 
approximation ^ positive 
elements 
Main MIMO realization for >2 terminals 

hurdles only for admittance H(i) and 

H(.) 
Dense MIMO netlists 
For p ports, k poles 

0{p k) circuit elements 
Positive R, L, C not guaranteed 



Passivity ensured via congruence 

transformation W, W^W = I on X in 
MNA form 
Preserved I/O and MNA structure 
Unstamping easy for MIMO via impedance 



^ may loose rank 



Reduced 'S /€ , are dense 
For p ports, k moments 

OijP'k^) circuit elements 
Positive R, L, C not guaranteed 



12.5 Conclusions and Outlook 



A framework for realizing reduced mathematical models into RLC netlists was 
developed. Model reduction by projection for RLC circuits was described and 
associated with two synthesis approaches: Foster realization (for SISO transfer 
functions) and RLCSYN [29] synthesis by unstamping (for MIMO systems). The 
approaches were tested on several examples, and a comparison is shown in 
Table 12.2. Future research will investigate reduction and synthesis methods for 
RCLK circuits with many terminals. 
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Chapter 13 

Model Reduction Methods for Linear 
Network Models of Distributed Systems 
with Sources 

Stefan Ludwig and Wolfgang Mathis 



Abstract Distributed systems with distributed sources are modeled as large 
electrical networks with linear RLC-elements, independent sources and pins for 
the connection with other network models. Often these networks are too large to be 
simulated efficiently. Model reduction can be used to reduce these networks while 
approximating the behavior at the pins for the connection with other nonlinear and 
linear network models. In the standard model reduction the independent sources 
are extracted and connected by ports with the RLC-part of the network which is to 
be reduced. This extraction only enables a weak reduction for networks with a 
large number of independent sources, as the number of ports is very high. In this 
article an efficient reduction of networks with a large number of sources is pro- 
posed by taking the waveforms of the sources into account. The method is based 
on the reduction of the dimension of the function space of the waveforms with the 
help of function approximation. The basis functions of the function approximation 
are used in the network, which results a lower number of independent sources. 
Extracting this lower number of independent sources and reducing the network 
enables a higher model reduction with state of the art techniques. With the pro- 
posed method a smaller size and higher accuracy of the reduced model can be 
achieved. The validity and efficiency of the proposed method is shown by reducing 
example models. 

Keywords Reduced order modeling • Linear RLC networks with sources • Gen- 
eralized state space systems • Terminals • Impedance • Modified model analysis 
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13.1 Introduction 

In the modeling of distributed systems very large linear RLC-networks are created. 
For these very large networks a smaller network with approximated behavior at 
some nodes of interest, typically the pins for the connection with other network 
models, is required to enable fast simulations. With the help of model order 
reduction (MOR) and network synthesis it is possible to find such a reduced 
network [3, 7, 10, 24]. Replacing the original network with the reduced network 
enables investigations with a lower effort for simulations. 

For distributed systems with distributed sources the modeled networks also 
contain, next to passive RLC-elements a high number of independent sources. 
These models of distributed systems with distributed sources typically occur in the 
IC design and verification as for example in the modeling of power grids, substrate 
couplings and conducted emission behavior [14, 19, 23, 33]. In the standard 
approach the independent sources are extracted from the reducible RLC-part of the 
network and connected by ports. With this extraction only the passive RLC-part is 
reduced and passivity can be preserved during a MOR [2, 24, 28, 32]. For net- 
works with a large number of independent sources this extraction produces a large 
number of ports of the network to be reduced. A large number of ports is a strong 
limitation for the MOR [20, 30]. In this article we will propose an additional step 
prior to the MOR to overcome this limitation. In this step the dimension of the 
function space of the waveforms of the independent sources is lowered. Realizing 
this smaller function space as network elements and replacing the original sources 
by controlled sources results in a lower number of independent sources. Extracting 
this lower number of independent sources leads to a lower number of ports of the 
reducible network. In addition, network properties like passivity and reciprocity 
are preserved in the reducible part. The behavior at the pins for the connection 
with other nonlinear elements like drivers and loads is preserved in this first port 
reduction step but the behavior of the distributed sources is approximated. The 
resulting reducible network with a lower number of ports can now more efficiently 
reduced with state of the art MOR techniques by approximating the behavior at the 
pins. 

This article is structured as follows. In Sect. 13.2 the basics of model reduction 
are shown. In Sect. 13.3 a description of a large number of independent sources as 
additional step before an efficient model reduction is proposed. The efficiency of 
the complete reduction process is shown by reducing two example networks in 
Sect. 13.4. The article is concluded in Sect. 13.5. 



13.2 Background for IVIodel Reduction of Linear Networks 

This article will focus on distributed systems modeled as linear positive real valued 
RLC-networks with independent current sources. The impedance description of the 
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network is considered. However, all presented methods are also valid for the 
admittance or hybrid case and for networks with independent voltage sources. 

For MOR the electrical circuit is described by differential algebraic equations. 
With the help of modified nodal analysis, circuit equations in the frequency 
domain of order A^ in the form of 

(sC + G)x = Bu, y = L^X (13.1) 

are generated. The real positive definite matrices C, G G M.^^^ contain the stamps 
for the capacitors, resistors and inductors, u, y are the Laplace transformed p input 
currents and output voltages respectively, which are connected with the system 
vector X by the system matrices B £ M/^^'' and L G W^^ . The resulting impedance 
transfer function is given with 

Z{s)=l7{sC + Gy^B. (13.2) 

With the help of MOR a reduced system in the form of Eq. 13.1 is built by 
approximating the transfer function behavior. MOR algorithms based on the 
projection are commonly used for the reduction of large models. The advantage of 
the projection methods is the simple implementation and the low computational 
effort for the construction of the reduced model. For the reduction a projection 
matrix T e M.^^" has to be generated. By changing the system vector x — T^x the 
reduced system matrices of order n 

C=T^CT; G = T^GT; B = T^B; L = T^L (13.3) 

are calculated [24]. If the projection matrix is real, this method preserves the 
passivity of suitably described systems [24]. For the generation of the projection 
matrix different methods are used. 

The most common method is that the projection matrix is generated by using a 
single-point Krylov subspace [24]. It can be shown, that the moments of the 
reduced system match the moments of the original system to some order [24]. As 
an extension the union of multiple Krylov subspaces at different expansion points 
is used to generate the projection matrix [7], which is in the following named MP- 
Krylov. With this type of projection matrices the order of the reduced model is 
proportional to the number of ports. For models with a large number of ports, this 
represents a strong limitation for the possible order reduction [30]. 

Another approach for the generation of the projection matrix is proposed in [27] 
with Poor Man's Truncated Balanced Realization (PMTBR) using congruence 
transformation (Section V.E in [27]). An approximation to the controllability 
Gramian is generated and an approximation to the Hankel singular values is cal- 
culated. As shown in [28, 29] the approximated Hankel singular values are used 
for order and error control. With this projection matrix the order does not directly 
depend on the number of ports. However, there is an indirect dependence on the 
number of ports and the possible order reduction [27]. 
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For investigations with electrical simulators the order reduced system is syn- 
thesized with network-synthesis algorithms as described for example in [20, 25, 35]. 

As these methods are only applicable to networks with a relatively low number 
of ports some advanced MOR methods dealing with networks with a large number 
of ports are developed. The underlying principles can be divided into two methods: 
using properties of the network structure and using input information. The methods 
presented in [9, 18] belong to the first class and use the correlation of the ports 
which is efficient if the models are regularly structured. The same class includes 
methods of dividing the network into substructures with a lower number of ports, as 
for example presented in [5]. For the class of methods where input information is 
used, for example the input correlated TBR method where the dependencies of the 
input excitations are taken into account in the model reduction process is presented 
in [31]. The reduced order analysis methods presented in [16, 17, 34] rely on 
determined inputs at all ports of the network, which is more likely for a simulation 
method as all inputs have to be determined and no connection with other nonlinear 
network elements is possible. The method presented in [21] uses the information of 
parameterized loads connected to the reducible network for a more efficient 
reduction with parametric MOR methods. In [20] a method based on the determined 
behavior of independent sources for reducing the number of ports is presented, 
which allows for the connection with other network models at some specified nodes 
but is limited to groups of internal sources with equal or proportional waveforms. 



13.3 Description of Distributed Sources 

In this article a generalization of the method of [20] is presented which is not 
restricted to groups of independent sources with proportional waveforms and 
includes waveforms that can be decomposed into or approximated by a small set of 
basis functions. Another advantage of this method is that unlike in [9, 18] no limi- 
tations for the structure ofthe linear network are necessary. Unlike in [16, 17, 34] not 
all inputs of the model have to be determined during the reduction which allows for 
the connection with other network models in simulations. The disadvantage is that 
only the number of ports where the input is determined is reduced and thereby the 
method is designated for networks with a large number of independent sources and a 
low number of pins for the connection with other network models. The method is an 
additional prior step to enhance the MOR. 

In the standard description the independent sources are extracted from the RLC- 
network and connected through ports with the remaining network (Fig. 13.1). In 
this way the reducible part of the network contains only positive real valued RLC- 
elements and thereby the network is passive and reciprocal. Passivity can be 
preserved during MOR using adequate reduction algorithms [2, 24, 28, 32] as well 
as reciprocity [10]. For networks with a large number of independent sources, the 
disadvantage of this extraction is the large number of ports which limits the order 
reduction [8, 20, 30]. 
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Fig. 13.1 Reducible network 
with extracted independent 
sources and a large number of 
ports 
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In this article a description of the sources in the network, which results in a 
lower number of independent sources while approximating the behavior at the pins 
is presented. The description of the sources is a prior step to the MOR and meets 
the following requirements: Firstly, preserve reciprocity and passivity of the 
reducible network in order to enhance network synthesis and guarantee stable 
time-domain simulations of the reduced model. Secondly, preserve properties of 
matrices C,G like (semi-) definiteness for passivity-preserving reduction [10, 24] 
and (J-) symmetry and typical block structure to enhance network synthesis after 
the reduction [10, 25, 35]. The general structure of the network is shown in 
Fig. 13.2 for an RLC-network with q independent current sources. 

Firstly we will have a look at the waveforms of the independent sources. 
A reduction of the dimension of the function space based on decomposition and 
approximation of the waveform functions is used. The q waveforms of the inde- 
pendent sources of the network are described by superposition with the weights w,_/ 
of b basis functions 



/;«£w,v/,(f) \<l<q. 



(13.4) 



/=i 



For this function approximation several methods are known in the fields of 
approximation theory, black box modeling and artificial neural networks. In the 
following some of the most often used methods are presented. 



Fig. 13.2 Passive and 
reciprocal reducible network 
with replaced sources and a 
small number of ports 
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A special and most simple case is the grouping of equal or proportional 
waveforms as presented in [20], where one representative waveform of every 
group is used as basis function. The number of groups specifies the number of 
necessary basis functions. 

For sources with waveforms described by continuous piecewise-linear (PWL) 
functions an exact decomposition into (finite) ramp functions is possible, e.g. 
described in [1]. The number of basis functions is now given by the number of 
points which are used for the waveform description. If the number of points 
describing the waveform is lower than the number of sources a reduction of the 
function space is achieved. If the PWL functions are periodic, the basis functions 
can also chosen as periodic ramp functions and thereby the number of basis 
functions is given with the number of data points in one period of the waveform. 
For discontinuous PWL functions the representations of [6] can be used. If the 
waveforms are piecewise constant functions, shifted bar functions can be used as 
basis functions. 

If waveforms are specified in a limited range of time, for example if the 
waveforms are generated from measurements or modeling in a finite time range, 
several classes of basis functions used in the field of approximation theory and 
artificial neural network modeling can be used. For every waveform the approx- 
imating basis functions can be obtained as algebraic polynomials to any degree of 
accuracy as stated in the Weierstrass approximation theorem. An example is the 
well known Taylor series expansion. Radial basis functions which are used in the 
black box modeling are also suitable as basis functions, as they are universal 
approximators [11, 26]. Another class of basis functions are the sigmoid functions, 
which are almost always capable of universal approximation [13]. For periodic or 
time- or band-limited waveforms the approximation with a fourier series expan- 
sion can be used. The basis functions are then trigonometric functions. The series 
converges almost everywhere to the waveform functions which is stated by the 
Carleson-Hunt theorem and thereby a finite number of basis functions is sufficient. 

The discrete wavelet transform, which is widely used in signal- and data pro- 
cessing can also be used for the reduction of the dimension of the function space. 
With this transformation one dimensional wavelets are used as basis function for 
the approximation as for example presented in [22]. 

For this approximative methods the number of necessary basis functions depends 
on the desired accuracy of the approximation of the waveforms. For the number of 
basis functions a tradeoff between the number of basis functions and the desired 
accuracy has to be found. For a higher accuracy a higher number of basis functions is 
necessary and the reduction of the dimension of the function space is lower. 

Any other method of decomposition into or approximation by a finite number of 
basis functions is feasible, as long as the number of basis functions b is smaller 
than the number of independent sources q in the network. As for most approxi- 
mation methods the number of basis functions depends on the desired accuracy 
and the number of basis functions has to be lower than the number of independent 
sources waveforms this method is most suitable for networks with a large number 
of independent sources. 
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The inclusion of the reduced dimension function space in the network model in 
the form of Fig. 13.2 is divided into the following four steps. Independently on the 
decomposition or approximation method for every basis function an independent 
current source with the basis waveform is added in the first step in the network 
model, resulting in b additional independent sources. In the second step all 
q original independent current sources are replaced by current controlled current 
sources (CCCS), controlled by the b additional independent sources. The gains of 
the CCCS are given by the weights Wjj in Eq. 13.4. In the third step, voltage 
controlled voltage sources (VCVS), connected with the additional independent 
sources, are used for the preservation of passivity and reciprocity if the gains are 
chosen according to the gains of the CCCS, as is shown later on in this section. In 
the last step, only the b additional independent current sources are extracted and 
connected through ports with the reducible part of the network. In this way a 
reducible network with a reduced number of ports is generated. The resulting 
reducible network also contains controlled sources in addition to the RLC- 
elements and is called network with replaced sources throughout the article. 

In the following the description of the reducible network with replaced sources 
and a lowered number of ports is given, whereas the system matrices C, G have the 
typical properties of pure RLC-networks like (J-)symmetry, (semi-)definiteness 
and block structure. An RLC-network with A^ nodes, p pins and q independent 
current sources, which waveforms are the superposition of b basis waveforms, is 
considered. For the generation of the system matrices of the network with replaced 
sources the incidence matrices Kc,KR,Kz,,Kcccs,Kpin for capacitors, resistors, 

inductors, CCCSs and pins are used. The matrices Pcccs G W'', Pvcvs e M'"'^ 
contain the parameters of the gains of the controlled sources. The branch consti- 
tutive relations are given with 



ic = sCuc, iR = Gur, ul = sLiL 

icccs = Pcccsivcvs, uvcvs — Pvcvs ucccs- 



(13.5) 



With Kirchhoff's Current Law Ki = we get 



Kc 













-Kcccs 




K 



pin 



ifi 
icccs 

Ipin 

ivcvs 

\ h / 



(13.6) 



with I as the identity matrix. Notably the network is divided into two parts which 
are not interchanging charges. The first part contains the RLC-elements, the 
CCCSs and the pins and the second part the VCVS and the additional independent 
sources. The voltages are defined as 



232 



S. Ludwig and W. Mathis 



/ Uc \ 




/ K^ 


Ur 




K^ 


Ul 




Kl 


ucccs 




'^cccs 


Upin 




Kpin 



V UvCVS / 



V 








1/ 



■■vcvs 



(13.7) 



where u"^ contains the potentials of the first part of the network and Uycvs contains 
the potentials of the second part of the network. We define a system of circuit 
equations with 
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(13.8) 



(13.9) 



Since the currents i^ and ipjn are the inputs u of the system, the voltages u*^ and 
inductor currents ii form the system vector x and the voltages across the additional 
independent sources Uvcvs and at the pins Upin are the outputs y, the system has the 
form of equations (13.1) with C,G having the typical MNA properties of pure 
RLC-networks without controlled sources. The controlled sources only appear in 
the matrices B and L. The gains Pycvs of the VCVSs are now chosen with 



vcvs 



cccs 



(13.10) 



and thereby the condition B = L holds. With this description it is shown, that the 
controlled sources which normally influence the matrix G can be described in a 
way, where only B,L are changed and properties of C,G like definiteness, sym- 
metry and structure of pure RLC-networks are conserved, enhancing the reduction. 
This system of the network with replaced sources now has, in addition to the 
ports for the pins, only b ports for the independent sources which is less than the 
number of ports q if all sources were extracted. In the following, it is shown that 
this reducible network is passive and reciprocal. 

Theorem 1 The network with replaced sources is passive. 

Proof For the proof of passivity, the conditions for passivity have to be consid- 
ered. The necessary and sufficient condition for the passivity of the impedance 
transfer function Z(i) of the model is positive-realness [15]: 
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1. Z(i*) — Z*{s) where is the conjugate complex operator 

2. Z{s) is positive, that means a'^[Z{s) + Z"{s)]a>0, with H as Hermitian 
operator is satisfied for all complex s with Re{s) > and for all finite complex 
vectors a 

Since the matrices stay real while replacing the independent current sources, the 
first condition is still fulfilled. 

The second condition of positive-realness can be written as 

a"[Z{s) + Z"{s)]a = a"l.^[{sC + G)-'+{sC + G)-^]Ba. (13.11) 

With B = L a finite complex vector b = Ba = La is defined and the transfer 
function with replaced independent current sources is positive 

b^[(.vC + Gy^ + {sC + Gr"]h > 0. (13.12) 

Since the controlled sources are described in a way that does not influence the 
system matrices they stay positive definite and real while replacing the indepen- 
dent sources. This means that the passivity of the network is conserved, as long as 
L = B holds, which is guaranteed by Eq. 13.10. D 

Theorem 2 The network with replaced sources is reciprocal. 

Proof For the proof of reciprocity, the condition for reciprocity has to be con- 
sidered. The condition for a model to be reciprocal is that the impedance transfer 
function is symmetrical [4, 15]: 

Z{s)^Z^{s). (13.13) 

This condition can be written as 

L^(5C + G)"'b = B^(.jC + G)"^L, (13.14) 

which is fulfilled since system matrices C and G can be written in symmetric form 
by multiplying C, G with an appropriate diagonal matrix J and with Eq. 13.10 the 
condition B = L still holds. D 



13.4 Examples 
13.4.1 RCI-Grid 

As an example, an RCI-grid network as shown in Fig. 13.3a which is widely used 
in power grid simulations [14, 23] is utilized. All reductions and simulations of this 
example are performed on an actual personal computer platform.' The example 
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(a) Section of the network structure 



^ X 
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si 
|i 
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3 '-1 



01234560123456 
time (s) time (s) 

(b) Some of the 55 current sources waveforms 



Fig. 13.3 Example 50 x 50 RCI-grid network, a Section of the network structure, b Some of the 
55 current sources waveforms 




0123456 0123456 0123456 
time (s) time (s) time (s) 

Fig. 13.4 The three basis waveforms of the RCI-grid example using decomposition 



grid with 50 x 50 nodes contains 4900 resistors and 2500 capacitors and two pins 
at two opposite corners for the connection of nonlinear driver models. 

The values of the elements are randomly chosen in the ranges 1 ± 0.5 Q and 
0.1 ± 0.05 F for the resistors and capacitors, respectively, q — 55 independent 
current sources with PWL-waveforms are connected between arbitrary chosen 
nodes in the network and the ground node. Some PWL-waveforms used in the 
example are shown in Fig. 13.3b. All PWL-waveforms are decomposed into the 
three basis-waveforms shown in Fig. 13.4. 

With the standard method of extracting all independent sources the system has 
57 ports. Only three independent sources are necessary if the waveforms of the 
independent sources are decomposed into the three basis-waveforms shown in 
Fig. 13.4 and the network with replaced sources of Sect. 13.3 is used. This results 
in five ports of the reducible part of the network. By this network alteration the 
behavior at the two pins of the model is not changed as the decomposition is exact. 
The transfer function of the system of the network with replaced sources can be 
calculated almost four times faster before the reduction with MOR due to the 
lower number of ports (Table 13.1). The network with extracted sources as well as 
the network with replaced sources are reduced with the PMTBR algorithm as 
described in Sect. 13.2. The 25 frequency points used in the reduction process are 
equally distributed in the frequency range 10"^ <Si < 10^. The reduced systems 
are generated within less than one minute, which is not critical as the model 
reduction process is done offline before the simulation. In Fig. 13.5 the approxi- 
mated Hankel singular values of both systems are shown. The singular values of 
the system with replaced sources decay faster. This means, that for a given 
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Fig. 13.5 Hankel singular 
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Table 13.1 Compaiison of reduction results of the 50 x 50 RCI-grid 
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reduced with 
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maximum error the network with replaced sources can be more efficiently reduced 
than the network with extracted sources. Likewise for a given reduced order the 
reduced model of the network with replaced sources is more exact than the 
reduced model of the network with extracted sources. Firstly both systems are 
reduced to a maximal singular value of 10~ , resulting in an order of « = 40 for 
the system with replaced sources and n = 165 for the system with extracted 
sources. The MATLAB -speed-up of both reduced systems for the calculation of 
the transfer function in several frequency points is shown in Table 13.1. Both 
reduced systems with the maximal singular value of 10~ show a good agreement 
with the original system, as shown for the transfer function Zn in Fig. 13.6. 
However, since the reduced system of the network with replaced sources is 
smaller, it can be simulated faster than the reduced system of the network with 
extracted sources (52x in comparison to 9.4x, Table 13.1). Secondly, both sys- 
tems are reduced to the same order of « = 40. The speed-up of the system of the 
network with extracted sources is almost as high as for the system of the network 
with replaced sources (42x and 52x, Table 13.1). Nevertheless, the reduced 
system of the network with replaced sources is more accurate (Fig. 13.6) than the 
reduced system of the network with extracted sources. 

With this example it is shown that the proposed replacing of independent 
sources as an additional step before the reduction can lead to smaller and therefore 
faster models than the standard method of extracting the sources. It was also 
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Fig. 13.6 Magnitude and phase of transfer function Zn of 50 x 50 RCI-grid 



Fig. 13.7 Section of the IC 
conducted emission network 




shown that for the same size of the reduced system our proposed method can lead 
to more accurate reduced models. 



13.4.2 IC Conducted Emission Model 

As a second example for a network with a large number of independent sources an 
industrial model describing the IC conducted emission (ICEM) behavior of the 
Infineon 32 Bit microcontroller TC1796 is utilized [33]. All reductions and sim- 
ulations of this example are performed on an actual server platform.^ The network 
is generated with the Infineon Technologies AG Germany EXPO-Tool [12]. In 
Fig. 13.7 a small section of the network is shown. The model contains more than 
35,000 passive RLC-elements and 328 independent current sources. For the con- 
nection with the printed circuit board model the network contains 61 pins. The 
circuit equations of the complete model have an order of 25,062. 

If every independent source is extracted and connected through a port with the 
reducible part of the network, the circuits equations contain 389 ports including 
the 61 pins. With respect to their prescribed waveforms the current sources of the 
network are grouped into eight groups and therefore eight additional independent 
sources are necessary. Replacing the 328 independent sources with CCCSs and 
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Fig. 13.8 Magnitude and phase of transfer function Z33 of ICEM 



extracting the eight additional independent sources, as described in Sect. 13.3, 
results in a reducible net'work ■with only 69 ports, including the 61 pins. The 
network with extracted as well as the network with replaced sources are reduced 
with the MOR algorithms described in Sect. 13.2. The results for the order 
reduction are presented in Table 13.2. All models show a close alignment in the 
frequency range of interest as shown for the arbitrary chosen transfer function Z33 
in Fig. 13.8. 

For the MP-Krylov reduction two expansion points, si = 2nlO^,S2 ~ 27tl0^ are 
used where in every point one moment is to be matched. For preservation of 
reciprocity the projection matrices are split according to [10]. As the reduced 
models are generated once and used often the necessary time of below one minute 
for the reduction process is not critical. As there is a direct dependency of the order 
of the reduced system on the number of ports with MP-Krylov, there is a signif- 
icant difference in the reduced order for both systems. Since the system of the 
network with extracted sources has more than five times as many ports as the 
system of the network with replaced sources, the reduced model is over five times 
the size of the reduced model of the network with replaced sources. This leads to 
much faster simulations of the reduced model of our proposed method for the 
calculation of the transfer function in several frequency points with MATLAB 
(132x in comparison to 2.8x, Table 13.2). 
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Table 13.2 Comparison of the reduction results of the ICEM 
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For PMTBR 20 frequency points are equally distributed in the range lO' < 
Si < 10'°. For the reduction prior to the simulation of the systemof the network with 
extracted sources around five minutes and for the system of the network with 
replaced sources around two hours are necessary. With reduction using PMTBR the 
reduced order does not directly depend on the number of ports and the reduced order 
is chosen according to the Hankel singular values. The approximation to the Hankel 
singular values calculated during PMTBR is shown in Fig. 13.9. It can be seen, that 
the singular values for the network with replaced sources decay faster than for the 
network with extracted sources. The largest singular value is set to 10~', which 
results in a reduced order of 201 and 590 for the replaced and extracted sources 
network, respectively. With this MOR algorithm the reduced model of the network 
with replaced sources is three times smaller than the reduced model of the network 
with extracted sources and the achieved speed-up in calculating the transfer function 
behavior with MATLAB is 188 x in comparison to 33 x (Table 13.2). 

With this IC emission model example is has been shown that the proposed 
network with replaced sources can lead to much smaller and therefore faster 
reduced models than the standard method of extracting all sources. 



13.5 Conclusion 



In this article an efficient description of independent sources in electrical networks 
that can achieve a higher model reduction is presented. The dimension of the 
function space of the waveforms of the sources is reduced with the help of function 
decomposition and approximation. A network with a reduced number of inde- 
pendent sources, whereas the behavior at the pins of the network is preserved is 
generated. This network can be more efficiently reduced with practically all model 
reduction techniques. The validity and superior efficiency of the proposed method 
compared to standard methods is shown by reducing two example networks. 
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Chapter 14 

Structure Preserving Port-Hamiltonian 

Model Reduction of Electrical Circuits 

Rostylav V. Polyuga and Arjan J. van der Schaft 



Abstract This paper discusses model reduction of electrical circuits based on a 
port-Hamiltonian representation. It is shown that by the use of the Kalman 
decomposition an uncontrollable and/or unobservable port-Hamiltonian system is 
reduced to a controllable/observable system that inherits the port-Hamiltonian 
structure. Energy and co-energy variable representations for port-Hamiltonian 
systems are defined and the reduction procedures are used for both representations. 
These exact reduction procedures motivate two approximate reduction procedures 
that are structure preserving for general port-Hamiltonian systems, one reduction 
procedure is called the effort-constraint reduction methods. The other procedure is 
structure preserving under a given condition. A numerical example illustrating the 
model reduction of a ladder network as a port-Hamiltonian system is considered. 



14.1 Introduction 

Port-based network modeling of physical systems (both in electrical and mechan- 
ical domains) leads directly to their representation as port-Hamiltonian systems 
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which are, if the Hamiltonian is non-negative, an important class of passive state- 
space systems. At the same time network modeling of physical systems often leads to 
high-dimensional dynamical models. Large state-space dimensions are obtained as 
well if distributed-parameter models are spatially discretized. Therefore an impor- 
tant issue concerns model reduction of these high-dimensional systems, both for 
analysis and control. The goal of this work is to show that the specific model 
reduction techniques of linear port-Hamiltonian systems preserve the port-Ham- 
iltonian structure, and, as a consequence, passivity, and demonstrate the possibility 
of applying those techniques to electrical circuits as port-Hamiltonian systems. 

Port-Hamiltonian systems are endowed with more structure than just passivity. 
Other important issues like interconnection between port-Hamiltonian systems, the 
presence of conservation laws and energy dissipation are also reflected by the port- 
Hamiltonian structure. In Sect. 14.2 we provide a brief overview of linear port- 
Hamiltonian systems. General theory on port-Hamiltonian systems can be found in 
[12]. We will show by applying the Kalman decomposition in Sect. 14.3 that the 
reduction of the dynamics of an uncontrollable/unobservable linear port-Hamilto- 
nian system to a dynamics on the reachability/observability subspace preserves the 
Port-Hamiltonian structure. This result holds both for energy and co-energy variable 
representations of linear port-Hamiltonian systems. The co-energy variable repre- 
sentation of port-Hamiltonian systems is considered in Sect. 14.4. It is shown in 
Sect. 14.4 that the reduced models in the co-energy coordinates take a somewhat 
"dual" form to the reduced models obtained in the standard energy coordinates. 

Within the systems and control literature a popular and elegant tool for model 
reduction is balancing, going back to [8]. One favorable property of model 
reduction based on balancing, as compared with other techniques such as modal 
analysis, is that the approximation of the dynamical system is explicitly based on 
its input-output properties. Balancing for port-Hamiltonian systems is considered 
in Sect. 14.6, see also [13]. We will apply two structure preserving model 
reduction procedures in Sect. 14.6 to linear port-Hamiltonian systems and show 
that the reduced order models are again port-Hamiltonian. One reduction proce- 
dure is called the effort-constraint reduction method. The other procedure is 
structure preserving under a given condition. Preliminary results of this paper are 
reported in [11] using scattering coordinates. Similar reduced order port-Hamil- 
tonian models are obtained in [6, 7] employing perturbation analysis. In Sect. 14.7 
we consider numerical simulations of a ladder network, and apply the effort- 
constraint method and the balanced truncation method in order to obtain reduced 
order models and compare the results. 



14.2 Linear Port-Hamiltonian Systems 

Port-based network modeling of physical systems leads to their representation as 
port-Hamiltonian systems (see e.g. [4, 9]). In the linear case, and in the absence of 
algebraic constraints, port-Hamiltonian systems take the form (see [11-13]) 
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X = {J - R)Qx + Bu, , . 

with H{x) = ^x^Qx the total energy (Hamiltonian), Q = Q^ >Q the energy matrix 
and R = R^ >0 the dissipation matrix. The matrices / — —J^ and B specify the 
interconnection structure of the system. By skew-symmetry of J and since R is 
positive semidefinite it immediately follows that 

— -x^Qx = u'^y - x^QRQx < Jy. (14.2) 

Thus if (2 > (and the Hamiltonian is non-negative) any port-Hamiltonian system 
is passive (see [12, 16]). The state variables x e K" are also called energy vari- 
ables, since the total energy H{x) is expressed as a function of these variables. 
Furthermore, the variables u S W\y e R'" are called power variables, since their 
product u^y equals the power supplied to the system. 

In the sequel we will often abbreviate J~R to F^J~R. Clearly 

F + F'^<0. (14.3) 

Conversely, any F satisfying (14.3) can be written as J — R as above by 
decomposing F into its skew- symmetric and symmetric part 

1 

2 ^ (14.4) 

R^~-{F + F^). 

Two special cases of port-Hamiltonian systems correspond to either R = or 
J — 0. In fact, if R = (no internal energy dissipation) then the dissipation 
inequality (14.2) reduces to an equality 

—x^Qjc^u^y. (14.5) 

dt2 ^ -^ ^ ' 

In this case the transfer matrix G{s) = B^Q{sI ~ JQ) B of the system (for 
invertible Q) satisfies 

G{s) = -G^(-5). (14.6) 

Conversely, any transfer matrix G{s) satisfying G{s) ~ ~G^{—s) can be 
shown to have a minimal realization 

x = JQx + Bu, , , 

y = B^Qx, ^^^-^^ 

(with in fact Q being invertible again). 

The other special case corresponds to / = 0, in which case the system takes the 
form 
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-RQx 
B^Qx, 



Bu, 



(14.8) 



with transfer matrix G{s) — B^Q{sI + RQ) B satisfying (for invertible Q) 

G{s)^G\s). (14.9) 

Conversely, any transfer matrix G{s) satisfying (14.9) is represented by a 
minimal state-space representation (14.8) with Q invertible, where, however, R 
need not necessarily be positive semidefinite. 

In these two special cases, either R — Q or J — Q, there is a direct relationship 
between controllability and observability properties of the port-Hamiltonian 
system. 

Proposition 1 Consider a port-Hamiltonian system (14.7) or (14.8), and assume 
delQ 7^ 0. The system is controllable if and only if it is observable, while the 
unobservability subspace jV is related to the reachability subspace Sk by 



jV = M^ 



;i4.io) 



with _L denoting the orthogonal complement with respect to the (possibly indefi- 
nite) inner product defined by Q. 

Proof For any port-Hamiltonian system (14.1) with F — J — R we have 



B^Q 

B^QFQ 

B^QFQFQ 



B 



F^QB : F^QF^QB 



Q- 



;i4.ii) 



Since the kernel of the matrix on the left-hand side defines the unobservability 
subspace, while on the right-hand side the image of the matrix preceding Q defines 
the reachability subspace \f F^ = F or F^ = ~F, the assertion follows. D 

Nevertheless, in general controllability and observability for a port-Hamiltonian 
system are not equivalent, as the following example shows. 

Example 1 Consider a port-Hamiltonian system 



"xi' 

y = 


1 


0] 


1 1 ■ 
1 -1 

■ 1 - 
-1 : 


1 

I 


1 


1 

' Xl' 


-r 

2 

1 




' Xl' 
X2 


+ 


T 




^14.12) 



corresponding to / 
not controllable. 



and/? 



1 
1 



. The system is observable but 
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14.3 The Kalman Decomposition of Port-Hamiltonian Systems 
14.3.1 Reduction to a Controllable Port-Hamiltonian System 



Consider a port-Hamiltonian system on a state space X which is not controllable. 



Take linear coordinates x - 
reachability subspace M (Z ?£: 



such that vectors of the form 



span the 





'x{ 


= 


■^11 

Fix 


Fn 

Fl2_ 


en 

.221 


Qn 
Qii. 


' x{ 


+ 


'B{ 
Bi. 


y^\B] Bi;] 


Gil 212" 


'x{ 












.621 


0.11 \ 


.^1. 









By invariance of ^ (see e.g. [10]) this implies 



M, 



(14.13) 



FiiQi 



F2iQi\ — 0, 
B2 = 0. 



(14.14) 



It follows that the dynamics restricted to ^ is given as 

Fi2Q2i)xi +Biu, 



(FuQu 



X] 



y^B]QuXi. 



(14.15) 



Now let us assume that F22 in (14.14) is invertible. Then it follows from (14.14) 
that 221 = -^22' -^21211- Substitution in (14.15) yields 



Xl = (Fii --F'l2/^22^2l)2ll-«l +BiU, 

y^BjQuXi, 



;i4.16) 



which is again a port-Hamiltonian system. Indeed, F + F^ <0 implies that the 
Schur complement F = Fn — F12F22F11 satisfies F + F^ <0. 

Remark 1 Note that F is skew- symmetric if F is skew-symmetric, and is sym- 
metric if F is symmetric. 

Remark 2 The Schur complement of a general F with singular F22 is not defined. 
Nevertheless, it is still possible to extend the definition of the Schur complement of 
F to the case where F22 is singular if F is symmetric, which corresponds to the 
purely damped port-Hamiltonian systems (14.8). For details see Lemma 1 in the 
appendix at the end of the paper. 
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14.3.2 Reduction to an Observable Port-Hamiltonian System 

Consider again a port-Hamiltonian system (14.1) and suppose the system is not 



observable. Then there exist coordinates x — 



subspace J^ is spanned by vectors of the form 

again [10]) it follows that 

F\iQi2 + F12Q22 = 0, 
BlQn+BlQ22 = 0. 

The dynamics on the quotient space '3C j .jV is 
y = 5[eiiXi+B[e2iXi. 



such that the unobservability 
By invariance of jV (see 

(14.17) 



;i4.i8) 



Assuming invertibility of Q22 it follows from (14.17) that F\2 = ~F\\Q\2Q22 
and Bl = -B[Qi2Q22'- Substitution in (14.18) yields 



x\ = Fn{Qu - Qi2Q22Q2i)xi +Biu, 
y^B\iQu-QnQ22Q2i)xu 



(14.19) 



which is again a port-Hamiltonian system with Hamiltonian H{xi) = 

\AiQn - QnQ.2lQi\W 

Remark 3 Note that (Qu - Q12Q22Q21) > if Q > 0. 

Remark 4 Since Q is symmetric the definition of the Schur complement of Q can 
be extended to the case that Q22 is singular. For details see Lemma 1 in the 
appendix at the end of the paper. 



14.3.3 The Kalman Decomposition 



It is well known that a linear system x ~ Ax + Bu, y — Cx can be represented in a 
suitable basis as (see [10, 17]) 

'An An ' 
A22 

A31 A32 A33 A34 

A42 A44 

with 3^ = 3^1 X iF2 X ^"3 x ^^4, where S^i is the part of the system that is both 
controllable and observable, f£2 is uncontrollable but observable, :Tj, is control- 
lable but unobservable, while ^4 is uncontrollable and unobservable, that is 





'Bl' 




\cj] 


, B = 



B3 


,c = 


CI 
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J/" = ^3 X 'dC^, 






.^ = J^i X SC^. 


Writing out 








"All Ai2 

A22 

A31 A32 

. A42 





A33 




■ 



A34 
A44. 


= 


F\\ F12 Fn -F14I 

F2\ F22 F23 F24 

F31 Ft,2 /^33 ^34 

.-F41 F42 F43 F44J 



;i4.20) 



Gil 612 Gi3 ei4 

Q21 622 Q23 624 

G31 232 Q33 634 

. G41 242 243 244. 



this implies that the blocks of the F and Q matrix satisfy 



(fl)Fil2l3+ ^12223 
(^)^ll2l4 + ^12224 
(c)f2l2ll +^22221 - 

{d)F2\Qn +F22Q23 
{e)F2iQH + F22Q24 
{f)F4iQu + F42Q21 ' 
{8)F4\Qi3 + F42Q23 

and similarly by writing out 



-^13 233+^14243=0, 
-f 13 234+^14244 = 0, 

^23231 + ^24241 — 0, 
' F2iQii + ^24243 = 0, 
^23 234 + ^24244 = 0, 

^43231+^^44241 =0, 
■ f 43 233 + -^44243 = 0, 



B 



Bl 



2ii 2i2 2i3 

221 222 223 

231 232 233 

241 242 243 



2l4 
224 
234 
244 



we obtain 



5[2l3+B(233=0, 
fi[2l4+B[234 = 0. 



^14.21) 



[Ci C2 0], 



;i4.22) 



The resulting dynamics on S^\ (the part of the system that is both controllable 
and observable) can be identified in port-Hamiltonian form, by combining the 
previous two reduction schemes corresponding to controllability and observability. 
Indeed, application of Sect. 14.3.2 yields the following observable system on 

a:i X X2 



y=[B\ 



Fu 

F21 

OI2 



^12" 

^=^22 


2 


Xi 
X2 


+ 


■fli" 




Xi 






X2 


: 









^14.23) 



where 



2 



2ii 2i2 
221 222 



2l3 2l4 
223 224 



233 234 
243 244 



231 232 
241 242 



(14.24) 
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Next, application of Sect. 14.3.1 to (14.23) yields the following port-Hamil- 
tonian description of the dynamics on SC\ 



ii = (Fx\-Fx2F^lF2{)QixXi+Bxu, 
y = BjQuXu 



Tn (14.25) 



having the same transfer matrix as the original system (14.1). 

Further analysis (using the well-known matrix inversion formula) yields 

Qn = Qu - Gl3(e33 - 234644' 243)"' G31 

+ Q14Q44 243(633 - G34G44 G43) 631 ,^^ 26) 

+ Gl3G3tG34(G44 - G43e33'e34)"'G41 
-Gl4(G44-G43G3l'G34)"'G41. 

Remark 5 By first applying the procedure of Sect. 14.3.1 and then applying the 
procedure of Sect. 14.3.2 for zero initial conditions it can be shown that we obtain 
the same port-Hamiltonian formulation. 



14.4 The Co-Energy Variable Representation 

In this section we assume throughout that the matrix Q is invertible. This means 
that 

e^Qx (14.27) 

is a valid coordinate transformation, and the port-Hamiltonian system (14.1) in 
these new coordinates takes the form 

e = QFe + QBu, F = J ~ R, ^^^ ^g) 

y — B^e. 

Since e ^ Qx = ^ (x), with H{x) — \x^Qx the energy, the variables e are usually 
called the co-energy variables. 

Example 2 Consider the LC-circuit in Fig. 14.1, with q the charge on the 
capacitor and </» 1,(^2 t^he fluxes over the inductors L\,L2 correspondingly. The 
energy (in the case of a linear capacitor and inductors) is given as 

H{q,4>„^2)=^q^+^y,+:^y^, (14.29) 

and X = [q,4>i, (f)2\ ^re the energy variables, in which the system takes the port- 
Hamiltonian form 
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(14.30) 



with u,y being the voltage across and the cuirent through the voltage source. The 
co-energy variables 



r qic 1 




\vc^ 


4>y/U 


= 


hi 


UilLi] 




[hi] 



are the voltage over the capacitor and the currents through the inductors, leading to 
the following form of the dynamics 



;i4.3i) 
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Note that 



d 1 



dt2 



e^Q-'e 



-y{F 



F^)e 



e^Bu 



-e^Re + u^y, 



[U32) 



and thus if (2 > then V{e) = |e^2"'e (the Legendre transform of //(x) =^x^Qx) 
is a storage function of (14.28). V{e) is called the co-energy of the system, which 
is in this linear case equal to the energy {V{Qx) = H{x)). 

A main advantage of the co-energy variable representation of a port-Hamilto- 
nian system is that additional constraints on the system are often expressed as 
constraints on the co-energy variables (see also Sect. 14.6) 

The reduction of the port-Hamiltonian system to its controllable and/or 
observable part takes the following form in the co-energy variable representation. 
Interestingly enough, the formulas take a somewhat "dual" form to the formulas 
obtained in the energy variable representation. 
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Consider the system (14.28) in co-energy variable representation. Take linear 



coordinates e = 
of the form 



such that the reachability subspace ^ is spanned by vectors 



ei 




Gil Qn 




.^2. 
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ei 





Fn 
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Fn 
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Bi 
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u, 



By invariance of ^ this implies 

G21^11 
G2l5l 



G22-F21 — 0, 

G22B2 = 0. 



Hence the dynamics restricted to 3i equals 

ei = {QuFu+QnF2i)ei + {QuBi+QnB2)u 

= (GlI - Gl2G22'G2l)^Iiei + (Gil - Gl2G22'G2l)5lM, 

y = B\eu 



^14.33) 



;i4.34) 



;i4.35) 



which is a port-Hamiltonian system in co-energy variable representation, with 
energy matrix G = Gii ~ Gi2G22'G2i> and interconnection/damping matrix Fn. 
Notice that these formulas are dual to the corresponding formulas (14.16) for the 
controllable part of the system in energy variable representation, where 
the resulting interconnection/damping matrix is a Schur complement, while the 
resulting energy matrix is Gii- This duality is associated with the Legendre 
transform of the Hamiltonian H{x). 



Analogously, take linear coordinates e 
subspace J^ is spanned by the vectors 



such that the unobservability 



e2 
. This implies 



Gll^l2 + Gl2^22 = 0, 
52-0, 

leading to the observable reduced dynamics 

ci == (Gii^ii + Gi2^2i)ei + Gii^i" 

= Gii(^ii -Fi2F22F2i)ei +QuBiu, 
y^B\ei. 



;i4.36) 



^14.37) 



Combination of the above leads to a similar Kalman decomposition as in the 
energy variable representation. 



14 Structure Preserving Port-Hamiltonian Model Reduction of Electrical Circuits 251 

Remark 6 To extend the definition of the Schur complements of Q, F for singular 
022,^22 see Remarks 2, 4. 



14.5 Balancing for Port-Hamiltonian Systems 



Definition 1 The controllability and observability function of a linear system are 
defined as 



u 

Le(^o) = min - / \\u{t)fdt, x{~oo)=Q, .t(0) = xo (14.38) 
«eL2(-oo,0)2 J 



and 



L„{xq) = ^ / II y{t) f dt, uit) = 0, .t(0) - xo, (14.39) 



respectively. 

The value of the controllability function at xq is the minimum amount of input 
energy required to reach the state xq from the zero state, and the value of the 
observability function at xq is the amount of output energy generated by the 
state Xq. 

Tlieorem 1 Consider a linear time invariant (LTI) asymptotically stable 
system [8] 

X = Ax + Bu, 

(14.40) 
y — Cx. 

Then L,{xo) = ixJW-'xo and Lo{xq) = \xIMxq, where W = j^ e^'BB^e^^'dt is 
the controllability Gramian and M — Jj, e^ 'C^Ce^'dt is the observability Gra- 
mian. Furthermore W and M are symmetric and positive definite, and are unique 
solutions of the Lyapunov equations 

AW + WA^ = -BB^ (14.41) 

and 

A^M + MA = -C^C (14.42) 

respectively . 
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In the port-Hamiltonian case Eqs. (14.41) and (14.42) specialize to 

{J - R)QW + WQ{J - rY ^ -BB^ (14.43) 

and 

Q{J - rY M + M{J - R)Q = -QBB'^ Q. (14.44) 

Sometimes it is useful to proceed using so-called scattering representation for 
port-Hamiltonian systems (motivated by electrical domain), where controllability 
and observability Gramians are related to the energy matrix Q as M <Q< W^^ , 
and Hankel singular values Gj are all <^ 1 (see [11, 13]). 

Nevertheless in this paper we proceed without using scattering coordinates. 

The balancing coordinate transformation S,x — Sxf,, where x^ denotes balanced 
coordinates, clearly preserves the port-Hamiltonian structure of the system (14.1): 

Xb = {Jb - Rb)QbXh + BbU, (]AA=i\ 

y = BlQbXh, 

where S^^RS^^ = Rb = Rl>0 is the dissipation matrix, S^^JS^^ = Jb = —Jj, is 
the structure matrix and S^QS = Qb = Ql^Ois the energy matrix in the balanced 
coordinates Xb- In this case, B/, = S^B. 

Similarly, the port-Hamiltonian structure is preserved applying balancing coor- 
dinate transformation T,e~Tei,, to the port-Hamiltonian system (14.28) in 
co-energy coordinates. 

Now bringing the system (14. 1) into a balanced form where W — M (see [8, 14]) 
and computing the square roots of the eigenvalues of MW which are equal to the 
Hankel singular values (see [3]) provides us the information about the number of 
state components of the system to be reduced. These state components require large 
amount of the incoming energy to be reached and give small amount of the out- 
going energy to be observed. Therefore they are less important from the energy 
point of view and can be removed from the system (see also [1]). 



14.6 Reduction of Port-Hamiltonian Systems in the 
General Case 

For a general port-Hamiltonian system in energy (14.1) or co-energy (14.28) 
coordinates with no uncontrollable/unobservable but with "hardly" controllable/ 
observable states we may apply balancing as explained in Sect. 14.5 and use one 
of the following structure-preserving reduction techniques. Since the techniques 
considered apply to the port-Hamiltonian systems in balanced coordinates, for the 
sake of simplicity in this section we skip the subscript 'b' writing x,e,J,R,Q,B 
instead of Xb, eh, Jb,Rb,Qb,Bb. 
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14.6.1 Effort-Constraint Reduction 

Consider a full order port-Hamiltonian system (14.1). We balance the system 
(14.1), but now in co-energy coordinates (and thus with another change of coor- 
dinates (14.27), obtaining the following balanced representation of our system 



e = Q{J ~ R)e + QBu, 
y = B^e, 



J (14.46) 



where the lower part of the state vector e 



is the most difficult to reach and 



to observe. 

Consider the system (14.1) again, but now in the coordinates where the system 
(14.46) is balanced 



x= {J - R)e + Bu, 
y = B^e. 



(14.47) 



A natural choice for the reduced model would be a model which contains only the 
e\ dynamics since the lower part of the state vector 62 is much less relevant from 
the energy point of view 

62 = G21X1 + 622^2 « 0. (14.48) 

Therefore the reduced system takes the following form 

xi = (7ii -Ru)ei +Biu 

^ {Jn - Rn){QuXi + Q12X2) + Biu, (14.49) 

(^ y-Bfe, =B[(QnXi+Gi2^2). 

After substituting X2 ~ — Q22'22i-'^i from (14.48) into (14.49), assuming that 
222' exists, the reduced system will take the final form in energy coordinates 

fii = (Ju - Rn)iQu - Qi2Q22Q2i)xi + Biu, 
1 y = BliQu-Qi2Q22Q2i)xu 

which is again a port-Hamiltonian system produced by the effort-constraint 
method. 



14.6.2 An Alternative Reduction Method 

Another structure-preserving way of model reduction of port-Hamiltonian systems 
assumes that we balance the system (14.1) and approximate the lower part of the 
state vector, but now in energy coordinates, plus its dynamics. Using the notation 
F :— J — R we obtain 
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X2 = {FnQu + F22Q2i)xi + B2U - 0, ^ ' ' 

with the reduced port-Hamiltonian system of the form 

xi = {FnQn+Fi2Q2i)xi+Biu, n4';9^ 

y^iB\Qu+BlQ2i)xi. ^^^'^^^ 

From (14.51) it immediately follows that Qii-^i ~ ~^22'^2iQii-'^i —F22B2U, 
assuming that F22 exists. Substituting in (14.52) yields 

j ii = (Fii — F12F22 F2i)QiiXi + {Bi — F12F22 B2)u, ,^. c-n\ 

{y^iBj- BlF2^F2,)Qnx, - {BIF2^B2)u, ^ '^'^ 

which is if {F12F22) = f22'^2i again a reduced system in the port-Hamiltonian 
form. 

Remark 7 The reduced order port-Hamiltonian systems (14.50) and (14.53) are 
automatically passive since the preservation of the port-Hamiltonian structure 
implies the preservation of the passivity property (see [12]). 

Remark 8 Although the approximation method with the reduced model (14.53) is 
similar to the well-known balanced truncation (x2 « 0, see e.g. [1] and the ref- 
erences therein) which gives the reduced order model of the form 

ii = AijXi +BiM, (14 54) 

ytt — Cixi , 

and less well-known singular perturbation method (x2 « 0, see [2, 5]) with the 
reduced order model 



ii = (All ~AnA2^A2i)xi + (fi, ~Ai2A22B2)u, 

ysp = (Ci - C2A22A2i)xi - C2A22B2U, 



l..,^., ^..-1..,. (14.55) 



we want to underline that it is different from these reduction methods since it is 
easy to show that neither of them preserves the port-Hamiltonian structure. 



14.7 Example 

We consider a ladder network, similar to that of [15]. In our case we take 
the current / as the input and the voltage of the first capacitor Uc^ as the port- 
Hamiltonian output. The state variables are as follows: xi is the charge qi over Ci, 
X2 is the flux </»i over Li, X3 is the charge (72 over C2, x\ is the flux 02 over L2, etc. 
In our case, as in [15], the resulting Hankel singular values obtained after 
balancing are not distinct enough. In order to overcome this difficulty we inject 
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Fig. 14.2 Ladder network 

additional dissipative elements, in this case resistors R], . . .,/?„, into the model as 
shown in Fig. 14.2. 

We take unit values of the capacitors C, and inductors L,, while Rq — 0.2, /?, = 
0.2, (■ = 1, . . .,n/2,R„/2+i = 0.4. A minimal realization of this port-Hamiltonian 
ladder network for the order « = 6 is 
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Adding another LC pair to the network (with appropriate resistors), which 
would correspond to an increase of the dimension of the model by two, will 
modify the ABC-model in the following way. The sub-diagonal of the matrix A 
will contain additionally L^K-iiC^L. The super diagonal of A will contain 
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Table 14.1 Decreasing Hankel singular values of the full order system 



Order 

Hankel singular values 


1 
0.8019 


2 
0.3746 


3 
0.2766 


4 
0.2172 


5 
0.2150 


6 
0.1537 


Order 

Hankel singular values 


7 
0.1497 


8 
0.0966 


9 

0.0727 


10 
0.0715 


11 
0.0125 


12 

0.0123 



Fig. 14.3 Frequency 
response 



10 



Original model vs. Reduced order (by 
Balanced Truncation method) model 
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Furthermore, the main diagonal of A will have 



L„n-^ 



■ii/2+l 



in the 
in the 



(m — 2, « — 2) position, zero in the [n— 1 , n — 1) position and — 

'•I ^ 

(«, n) position. 

We considered the 12-dimensional full order minimal port-Hamiltonian 
network and reduce it to a 5-dimensional one by the effort-constraint method 
considered in the previous section and the usual balanced truncation method. The 
non-minimal system can be first reduced to a minimal one as shown in Sect. 14.3. 
The Hankel singular values of the full order system in decreasing order are shown 
in Table 14.1. 

It is a well-known fact that the transfer functions of reduced order models 
obtained by the balanced truncation method approximate the full order transfer 
functions well in the high-frequency region and not that well in the low-frequency 
one (of course, depending on the application considered). Since the effort-constraint 
method is similar to the balanced truncation method (with the above explained 
modification in order to preserve the port-Hamiltonian structure and passivity) we 
expected approximations of similar nature. In Figs. 14.3 and 14.4 the frequency 
response of the full order model is shown vs. the frequency responses of the reduced 
order models, obtained by the balanced truncation method and the effort-constraint 
method respectively. The figures show that the reduced order transfer functions 
indeed behave in a similar way. 
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Fig. 14.4 Frequency 
response 
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Table 14.2 ;/e^- and ;/e2- 

norms for the error systems 


Reduced order 


system by 


,5foc,-norm 


.3f2-nonTi 


Balanced truncation method 


0.3411 


0.1541 




Effort-constraint method 


1.1999 


0.4491 



In Table 14.2 Jf'oc- and ,y^2-norms are shown for the error systems obtained 
after balanced truncation reduction and effort-constraint reduction. It follows that 
the error-norms for the effort-constraint method are larger than those for the 
balanced truncation method. 

Port-Hamiltonian systems are somewhere in between general passive systems 
and electrical circuits. We believe that whenever a transfer function of a full order 
electrical circuit is approximated well by the balanced truncation method, we can 
always apply the (port-Hamiltonian) structure preserving effort-constraint method 
in order to obtain an approximation of a similar quality as the approximation 
obtained by the balanced truncation method along with the preservation of the 
port-Hamiltonian structure and passivity. 

Important questions concerning general error bounds for the structure pre- 
serving port-Hamiltonian model reduction methods and about the physical reali- 
zation of the obtained port-Hamiltonian reduced order models are currently under 
investigation. 



14.8 Conclusions 



We have shown in Sect. 14.3 that a full order uncontrollable/unobservable port- 
Hamiltonian system can be reduced to a controllable/observable system, which is 
again port-Hamiltonian, by exploiting the invariance of the reachability/unob- 
servability subspaces of the original system. We discussed energy and co-energy 
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variable representations of port-Hamiltonian systems in Sect. 14.4, illustrated by 
the example of electrical networks, wiiere the energy variables are charges and 
fluxes, while the co-energy variables are voltages and currents 

Balancing for port-Hamiltonian systems is discussed in Sect. 14.5. The effort- 
constraint method and the other alternative reduction method are introduced in 
Sect. 14.6 and applied to a general port-Hamiltonian full order system showing that 
the proposed approximations preserve the port-Hamiltonian structure for the 
reduced order systems as well as the passivity property. In Sect. 14.7 we considered 
a full order ladder network and applied the balanced truncation method and the 
effort-constraint method in order to obtain reduced order models. 

Port-Hamiltonian structure preserving model reduction methods motivate to 
investigate further important issues about error bounds between full order and 
reduced order systems, and about the physical realization of the reduced order 
systems, e.g. as an electrical circuit. 

Acknowledgements We gratefully acknowledge the helpful remarks and suggestions of the 
anonymous referees which significantly improved the presentation of this paper. 



Appendix 



Lemma 1 Consider a symmetric negative (positive) semidefinite matrix F ^ F^ 
partitioned as 



F 



Fu 

Fii 



Fn 

F22 



Then the Schur complement of F given as 

F — Fu — F12F22 F21 

can be defined even for singular F22- 

Proof First consider the case when F22 7^ 0. Since the kernel of F22 is nonempty, 
the symmetric matrix F22 takes (after a possible change of coordinates) the fol- 
lowing form 



F22 



Fh 




with F22 invertible. Then the properties of a negative (positive) semidefinite 
symmetric matrix (14.57) shown in Lemma 2 yield 



F\2 



F" 
■'^12 


0' 



, F2l = 


>2"l 
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F,: = 



Fn 
Fu 



F'{2 
F\2 



(^22) 




^2\ ^21 





■7h 
21 





77a / pa \~^ zpa T^a ( j^a \~^ zpb 
^12^-^22^ ^21 -^12^^22/ ^21 

121^22) ^21 ^I2l^22J ^21 



;i4.56) 



which is independent of z. Hence we can let e — ^ 0. This shows that the Schur 
complement can be still defined even for singular ^22- 

If F22 = then (14.57) yield F\2 — 0,F2i = and F = Fn which completes 
the proof. D 

Lemma 2 Consider a negative semidefinite symmetric matrix F ~ F^ <Q parti- 
tioned as 



F = 



Then 



F\\ F\2 

F21 F22 



ker 7^22 ^ ker F12, 
im/^21 Q imF22. 



:i4.57) 



Proof First we prove that kerF22 C kerFi2. Since F is negative semidefinite it 



follows thatx^/o- < for all real vectors x — 



. Take X2 which is in the kernel of 



F22 and xi = Fi2X2. Then for a small positive constant e we have 



[ex[ xl] 



F\i F12 

F21 F22 



£Xi 

X2 



= B^X^FiiXi + £xIF2iXi + ex'[Fi2X2 + x\F22X2 

=: e^x^FiiXi + 2e||xi|| . 



Since the term 2e||xi|| is strictly positive we can choose £ such that 2e||xi|| 
prevails over e^x\FiiXi and therefore the expression above is positive. This is a 
contradiction to the negative semidefiniteness of F. Hence, the above expression is 
nonpositive if xi = = 7^12x2 with X2 in the kernel of /^i2. 

Using the fact that the image of a matrix is orthogonal to the kernel of the 
transpose of the same matrix we write for any z which is in the image of f 21 



z G im F2 



z 1. ker Fly 
z 1. ker F12 
z ± ker 7^22 
z e imFjj — imF22. 



Therefore im ^21 '-= im F22 which proves the claim. 



D 
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Remark 9 To prove expressions (14.57) for a positive semidefinite symmetric 

-EX] 

matrix take x = 

X2 



References 

1. Antoulas, A.C.: Approximation of Large-Scale Dynamical Systems. SIAM (2005) 

2. Fernando, K., Nicholson, H.: Singular perturbational model reduction of balanced systems. 
ffiEE Trans. Automat. Contr. 27, 466^68 (1982) 

3. Glover, K.: All optimal hankel-norm approximations of linear multivariable systems and their 
L°°-error bounds. Int. J. Control 39, 1115-1193 (1984) 

4. Golo, G.: Interconnection structures in port-based modelling: tools for analysis and simulation. 
Ph.D. thesis. University of Twente (2002) 

5. Green, M., Limebeer, D.: Linear Robust Control. Prentice-Hall (1995) 

6. Hartmann, C: Balancing of dissipative Hamiltonian systems. In: Troch I., Breitenecker F. 
(eds.) Proceedings of MathMod 2009, Vienna, February 11-13, 2009, no. 35 in ARGESIM- 
Reports, pp. 1244-1255. Vienna University of Technology, Vienna (2009) 

7. Hartmann, C, Vulcanov, V.M., Schiitte, C: Balanced truncation of linear second-order 
systems: a Hamiltonian approach. Multiscale Model. Simul. (submitted 2008). Available 
from URL http://proteomics-berlin.de/28/ 

8. Moore, B.: Principal component analysis in linear systems. IEEE Trans. Automat. Control 
AC-26, 17-32 (1981) 

9. Pasumarthy, R.: On analysis and control of interconnected finite-and infinite-dimensional 
physical systems. Ph.D. thesis. University of Twente (2006) 

10. Polderman, I.W., Willems, J.C.: Introduction to Mathematical Systems Theory. Springer- 
Verlag (1998) 

11. Polyuga, R.V., van der Schaft, A.J.: Structure preserving model reduction of port-Hamiltonian 
systems. In Proceedings of the 18th International Symposium on Mathematical Theory of 
Networks and Systems, Blacksburg, July 28- August I, 2008. http://sites.google.com/site/ 
rostyslavpolyuga/publications' 

12. van der Schaft, A.J.: L2-Gain and Passivity Techniques in Nonlinear' Control. Springer- 
Verlag (2000) 

13. van der Schaft, A.J.: On balancing of passive systems. In Proceedings of the European 
Control Conference, Kos, pp. 4173-4178, 2-5 July 2007 

14. Scherpen, J.: Balancing for nonlinear systems. Ph.D. thesis. University of Twente (1994) 

15. Sorensen, D.: Passivity preserving model reduction via interpolation of spectral zeros. Systems 
Control Lett. 54, 347-360 (2004) 

16. Willems, J.C.: Dissipative dynamical systems. Arch. Ration. Mech. Analysis 45, 321-393 
(1972) 

17. Zhou, K., Doyle, J., Glover, K.: Robust and Optimal Control. Prentice-Hal (1996) 



Chapter 15 

Coupling of Numerical and Symbolic 
Techniques for Model Order Reduction 
in Circuit Design 

Oliver Schmidt, Thomas Halfmann and Patrick Lang 



Abstract In this paper, we will give a short motivation for the use of symbolic 
methods for model order reduction and present some of these techniques. Fur- 
thermore, we will discuss the hierarchy usually at hand for circuit design and 
present a workflow for the exploitation of this hierarchy. By applying it to a simple 
circuit example, we will show how model order reduction (MOR) methods can be 
profitably employed. 

Keywords Model reduction • Symbolic model reduction • Hierarchical model 
reduction • Coupled symbolic-numerical model reduction • Structure preservation • 
Structure-exploiting model reduction • Structure-preserving model reduction • 
Analog circuit • Electrical circuit 



15.1 Motivation 

First, we will give a motivation for symbolic model order reduction methods 
(MOR methods) by using the example of the folded-cascode operational amplifier 
(see also [3]) depicted in Fig. 15.1. The original design showed an instability in the 
amplifier's small-signal behavior at a frequency of approximately 10 MHz (cf. 
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Fig. 15.1 Folded-cascode operational amplifier 

Fig. 15.2, solid curves), visible by a resonance peak, which was caused by a pair of 
complex conjugate parasitic poles. 

As a classic approach, one would have performed parameter variations and 
numerical simulations in a trial-and-error fashion to solve the underlying instability 
problem. But due to the fairly high number of parameters, this was not feasible here. 
However, a symbolic analysis of the amplifier allowed a computation of its transfer 
function. Since the exact symbolic transfer function consisted of more than 5 • 10'^ 
terms, a symbolic approximation was necessary, that further allowed a derivation of 
the formula for the dominant pole pair depicted in Fig. 15.3. 

By interpretation of this formula, one detected that one of the circuit's com- 
ponents — a Gate-Source capacitance — had dominant influence with respect to the 
unstable behavior. By adding an additional capacitance, the original capacitance 
value could be modified such that the resonance could be damped (cf. frequency 
analysis in Fig. 15.2, dashed curves) and a stable behavior after a redesign of the 
amplifier was obtained (cf. transient analysis in Fig. 15.4, dotted curve). 

It should be mentioned here that in this case symbolic MOR was used to 
improve the original design of the circuit and not the reduced model itself. 



15.2 Symbolic Techniques 



What in particular does nonlinear symbolic analysis aim for? Usually we start at a 
netlist description that is translated into an equation system {differential-algebraic 
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Fig. 15.2 The amplifier's transfer function: original (solid) and with an additional capacitance 
(dashed) 



(CCO + CL) gm$MN6 a/ Cgs$MP15gm$MN6 (Cgs$MP15 (CCO + CL) ^ gin$MN6 - 4CC0 ^ CLgin$MP15 ) 



"^ 2CC0CL 2CC0Cgs$MP15CL 

Fig. 15.3 Approximated formula for the dominant pole pair of the amplifier' s transfer function 



Fig. 15.4 Stable behavior 
(dotted) after redesign of the 
originally unstable behavior 
(solid) 
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equations, DAE) via standard graph theoretical methods like modified nodal 
analysis (MNA) or sparse tableau analysis (STA) (see e.g. [13]). Based on a 
reference solution of the original system, we can symbolically reduce it to a 
simpler form with less equations, less terms, less derivatives, and so on [14]. This 
simpler form represents a behavioral model for the original system, which can also 
be translated into a hardware description language (HDL) and then simulated using 
a circuit simulator. 
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The general workflow is as follows (see [11, 14]); according to a numerical 
analysis A e {AC analysis, DC analysis, transient analysis} and an input u we 
have a reference solution vp for the output v of the original DAE system F, as well 
as a user-given error bound s and an error function E to compute the actual error. 
This reference solution Vp is given by pairs of sampling points and interpolation 
values, hence the accuracy in comparison to the exact mathematical solution 
V depends on the numerical solver's choice of the step size. The complexity of F is 
reduced by iteratively applying reduction techniques R and comparing the 
(numerical) solution Vg of the so far reduced system G to the reference solution Vp. 
As long as the error is within the given bound, the performed reduction step is 
allowed, otherwise it is rejected. 

In the following, four symbolic techniques will be explained [14]. They are 
implemented in Analog Insydes [18], an add-on for Mathematica [17]. Analog 
Insydes is developed by Fraunhofer ITWM in Kaiserslautern, Germany. 



15.2.1 Algebraic Manipulations 

The first symbolic reduction technique we want to present is a purely algebraic 
manipulation. Here, certain variables are eliminated and substituted in the remaining 
set of equations. Additionally, independent blocks of equations are removed, e.g.: 



o'4S)^^^°-^(^'^«)^^ 



Obviously, this method is mathematically exact. Despite this, we have to be careful, 
since applying this method may lead to equations including a huge number of terms. 



15.2.2 Branch Reductions 

Using branch reductions, unused branches of piecewise-defined functions are 
detected and removed. Piecewise-defined functions occur, e.g., in the device 
model equations of transistors or diodes. Usually these components only work in a 
certain operating region that is contained, e.g., in the interval [a, b], then branch 
reductions are possible: 



/(x) = ^ f2{x) a<x<b ^f{x) =f2{x) for all x. 




Here, / is to be considered as a function that is contained in at least one term in at 
least one of the equations of the DAE system F. 
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15.2.3 Term Substitutions 

Here, an equation, which typically consists of a sum of products,' is considered, 
where certain terms are replaced by appropriate constant values, e.g. adequate 
average values. This technique can also be applied to different levels, e.g. when 
some of the summands are functions and their arguments themselves contain sums, 
and so on. 

In the following example, the 7th equation Fj in the equation system F is a sum 
of terms f„ where the Ath term f^ will be replaced by the constant k: 

N N 

Fj : Y^ ti{x) = ^ G,- : ^ r/(x) + k = 0. 

This method obviously is not an exact one, since it alters the solution x of the 
original equation system. The solution of the approximated system is denoted by x. 



15.2.4 Term Reductions 

This technique is a special case of term substitutions, since certain terms are 
replaced by zero, which means that they are cancelled: 

N N 

Fj:J2t,{x)^0^Gj: ^ t,{x)=0. 

i=l i=l,ijik 

Concerning all the above techniques, we consider that as long as the error that is 
caused by applying these methods is within the user-given error bound, the cor- 
responding reductions are valid. 



15.3 Hierarchical Systems 

Within the last few years the growing miniaturization of integrated circuits and 
increasing integration density of circuit components have led to nanoelectronic 
structures. It turns out that physical effects like thermic or electro-magnetic 
interactions cannot be neglected anymore. One also observes a very fast growing 
number of parameters for the device models for a single transistor within the last 



In MNA contributions of components to the system typically are of the formA'^ ■ f(Bx), where 
A and B are incidence matrices describing the circuit topology and /is a function describing the 
behavior of the corresponding component. The same expression holds for submodels. 
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VCC 




RBIAS 



VEE 



Fig. 15.5 Differential-amplifier circuit 

years. This forces the usage of model descriptions based on partial differential 
equations (PDE) besides the netlist based models or those based on DAEs, 
especially for the semi-conductor components. Hence, one has to cope with partial 
differential-algebraic equations (PDAE), coupled systems of DAEs and PDEs. 

Moreover, the circuit usually is hierarchical, as we have various subsystems 
coupled by a topology. Due to their respective accuracies, these subsystems have 
different mathematical descriptions. By using standard graph theoretical methods 
like MNA or STA (see [13]) to build up the describing equation system, this 
hierarchy is lost. So our goal is the exploitation of the circuit's hierarchical 
structure. 

As an example, let us consider the differential-amplifier circuit depicted in 
Fig. 15.5. The voltage sources VCC and VEE denote the voltage supply for the 
amplifier circuit, whereas the voltage source VI defines the input. The output is 
defined as the voltage-potential of node 5, denoted by Vout. The three sources are 
connected to the remaining circuit components via three transmission lines, for 
which we will use a discretized PDE model of the telegraph or transmission line 
equations with 20 line segments each (see e.g. [7] or [8], Chaps. 6 and 5). 



15.3.1 Symbolic Reduction of the Entire Circuit 



If we use MNA in node voltage formulation to set up the describing equation 
system for the transient circuit behavior, we obtain 167 equations with a total 
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Fig. 15.6 Left: Solutions of the original system (solid) and the reduced system (dotted) allowing 
2% error. Right: The corresponding error plot 

number of 645 terms. For the input VI, we chose a sine-wave voltage excitation 
with a magnitude of 2 V and a frequency of 10^ Hz. The supply vohages VCC and 
VEE are 12 V and —12 V, respectively. By using the symbolic reduction tech- 
niques presented in Sect. 15.2 and permitting an error of 2% for the output Vout 
to reduce the entire system, we obtain a system consisting of 124 equations with 
425 terms in total. For the reduction, a few hours are needed,^ in which more than 
95% of that time is spent for the computation of the transient ranking'* (see e.g. 
[4, 14, 15]). Figure 15.6 shows the solution and the error of this reduced system in 
comparison to the solution of the original system. 

According to these figures, our reduced system with only about 1% error is a 
very accurate approximation. 

If we allow an error of 10% for the reduction of the original system, we only 
obtain 44 equations and a total of 284 terms. The time needed is almost the same 
because of the ranking computation mentioned above. However, the reduced 
equation complexity has been achieved at the expense of accuracy (cf. Fig. 15.7). 



15.3.2 Exploitation of the Hierarchical Structure 

What will happen, if we take an 'intuitive' hierarchy into account? Therefore 
consider certain structural patterns in the circuit's netlist: the differential pair of 
transistors (together with some resistors) in the upper box on the right side of 
Fig. 15.8, a current mirror in the lower right, three transmission lines in the 
middle, and the sources on the left. 



x% error here means j^ of the maximum magnitude of the reference solution, considered on 
the entire time interval: ^ • sup,^;,^ ,,| ||vF(r)||^. 

^ The computations were executed on a Dual Quad Xeon E5420 with 2.5 MHz and 16 GB 
RAM. 

A ranking is a trade-off between accuracy and efficiency in computation time that estimates the 
influence of a term in the equation system on the system's solution. 
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Fig. 15.7 Left: Solutions of the original system (solid) and the reduced system (dotted) allowing 
10% error. Right: The corresponding error plot 




Fig. 15.8 Differential-amplifier circuit with its 'intuitive' hierarchy 

As said before, by using e.g. MNA to set up the describing equation system for 
the entire circuit, the hierarchy information is lost, since the system contains 
equations with mixed parts from different subsystems. But if we consider the five 
subsystems one by one, we can transmit the hierarchy information into the 
describing set of equations and reduce them separately. 

For the reduction of the three transmission lines, we translate the describing 
linear equation system into its state space form and apply Arnoldi's algorithm (see 
e.g. [5] or [1], Chaps. 10 and 11), a numerical approximation method based on 
projections on an appropriate Krylov space. For both the transmission lines that are 
connected to the voltage supplies VCC and VEE we iterate only one step, for the 



One step provides sufficient accuracy here, since the sources are DC voltage sources. 
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Fig. 15.9 Left: Solutions of the hierarchically reduced system {dotted) and the original system 
(solid) by applying the 3-step Amoldi iteration to the input voltage transmission line. Right: The 
corresponding error plot 

one connected to the input voltage VI we perform three steps in the Arnoldi 
iteration.*' Due to two additional equations and variables for internal port model- 
ing, we thus can reduce the transmission line subsystems from 50 down to 8 resp. 4 
equations. 

The current mirror and the differential pair subsystems are reduced symboli- 
cally by using the presented techniques from Sect. 15.2. An exact description of 
how this is done is given in Sect. 15.4. In case of the current mirror subsystem, the 
symbolic simplification yields 9 instead of 16 equations with a total number of 20 
instead of 59 terms. For the subsystem with the differential pair, we obtain a 
reduced system with 13 instead of 22 equations and 50 instead of 91 terms in total. 
For both of these reductions a 2% error bound^ is permitted, but we observe almost 
no further reductions, if we allow 10% error instead. Note that all of the reductions 
mentioned above are computed within seconds. How these reduction steps — 
particularly in the symbolic case — can be carried out will be explained in the next 
section. 

If we now plug together all the reduced subsystems to obtain a reduced form of 
the entire differential-amplifier circuit, the results are very accurate as one can see 
in Fig. 15.9. Instead of 167 equations and a total number of 645 terms, we only 
have 62 equations with 252 terms altogether. The error of the entire system 
compared to the original one is approximately 8%. 

A higher dimensional projection space obtained by the Arnoldi iteration pro- 
vides more accurate solutions, but of course also leads to a larger system. If, for 
example, we perform five iteration steps in the Arnoldi algorithm to reduce the 
transmission line that is connected to the input voltage VI instead of only three, we 
can further improve our results and obtain a maximum error of the entire reduced 



We performed some experiments with a higher number of iteration steps, but the gain of 
additional accuracy in comparison to the size of the larger system was rather mai'ginal for a 
number of iteration steps bigger than five. 

As we are reducing only the single subsystem, this error bound is not limiting the error of the 
entire circuit's output Vout to the given values (see Sect. 15.4 and Algorithm 1 for further 
details). 
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Fig. 15.10 Left: Solutions of the hierarchically reduced system (dotted) and the original system 
(solid) by applying the 5-step Amoldi iteration to the input voltage transmission line. Right: The 
corresponding error plot 



system of approximately 2% (Fig. 15.10). But then the reduced entire system 
consists of 66 equations and a total number of 396 terms. 



15.3.3 Using the Reduced Model with Other Inputs 

To check whether the reduced model obtained with our hierarchical approach 
works fine also for other inputs, we apply a pulse wave with a magnitude of 2 V, a 
sum of three pulses with magnitudes of 1, 2, and 3 V, respectively, and a sum of 
three sine functions 

f{t) := 2 ■ sm{2n • 10^ • r) + 2 • sin(27i • 2 • 10^ • r) + 1 • sin(27i • 5 • 10^ • f) 

to VI. The graphs of these functions are shown in Fig. 15.11 on the time interval 
/= [Os,10"^ s]. 
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Fig. 15.11 Three other inputs VI to test the reduced model of the differential-amplifier circuit 
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Fig. 15.12 Left: Simulation results of the original (solid) and the hierarchically reduced model 
(dotted), together with the corresponding input VI (dashed). Right: The corresponding error plots 

With these inputs, both the original differential-amphfier and our hierarchically 
reduced model^ are simulated. The graphs of the simulations as well as the cor- 
responding error plots are shown in Fig. 15.12. 

According to these figures, the obtained model works quite well in all the test 
cases and therefore can be used as a behavioral model of the original differential 
amplifier in a certain domain. 



15.3.4 Comparison of both Approaches 

The results of the two different reduction approaches for the differential-amplifier 
circuit are listed in Table 15.1. 



We chose the more accurate model with 66 equations, see Fig. 15.10. 
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Table 15.1 Results of the two different reduction approaches for the diflferential-amphfier circuit 
Orig. system Reduction of entire circuit Reduction exploiting hierarchy 



Equations 


167 


124 


44 


66 


62 


Terms 


645 


425 


284 


396 


252 


(Permitted) error 




2% 


10% 


~2% 


-8% 


Amoldi iterations 








(5/1) 


(3/1) 


Computation time 




Few hours 


Few hours 


Within seconds 


Within seconds 



Comparing both approaches, we conclude that the one that exploits the hier- 
archical structure is the better choice, since it delivers much better results in less 
computation time; we obtained reduced systems with almost the same accuracy in 
the 2% error case, but only about half the number of equations. Performing only 
three steps in the Arnoldi iteration to reduce the transmission line connected to the 
input voltage VI, we still received a reduced overall system that fit the original 
(reference) solution quite well. On the other hand, the reduction of the whole 
circuit permitting an error of 10% led to a system that fully exploits the error 
bound. 



15.4 Workflow for the Exploitation of the Hierarchy 

Now we come to the workflow for the reduction of the particularly symbolic 
subsystems of the entire circuit. It is schematically shown in Fig. 15.13. 

From the hierarchical segmentation, we have a set of interacting subsystems. 
Since we always need a reference solution to symbolically reduce a system, we 
need a 'closed circuit'. If we simply cut out a subsystem from its connecting 
structure, we will have no defined input/output behavior at its ports, i.e. we have 
no information about its current-voltage relation. However, the following algo- 
rithm provides a possibility for reducing such a subsystem (see also Fig. 15.13). 



sub- 
sys. 



sub- 
system- 



sub-' 
sys-- 



-^ 



-^ 



1^ 

c 




Fig. 15.13 Illustration of the workflow for the symbolic reduction of circuit subsystems 
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Algorithm 1 (Reduction of subsystems) 

(a) connect the subsystem to a test bench'' and record the vohages'" at its ports 

(b) remove the test bench and plug vohage sources to the subsystem's ports that 
deliver exactly the recorded vohages (as a data-based function) thus having the 
subsystems isolated 

(c) now the closed circuit can be reduced ' with appropriate (numerical or) 
symbolic methods presented in Sect. 15.2 

(d) remove all the above sources after the reduction of the subsystem and finally 
obtain a reduced subsystem that serves as a behavioral model 

This workflow has been applied to the subsystems of the differential-amplifier 
example and delivered the results described at the end of the previous section. 

It should be mentioned here that this approach only controls the errors at the 
ports of the single subsystems. Thus, one cannot pose a statement about or 
guarantee a certain global error, i.e. the error on the output of the entire (reduced) 
circuit when replacing the original subsystems by the reduced ones. This could 
possibly be a point of future investigation (see also Sect. 15.6). 



15.5 Comparison to Other Approaches 

In [9], LTI coupled control systems are considered, which are coupled by input/ 
output-relations. Moment matching approximation and balanced truncation, two 
numerical MOR methods, are used to reduce the order of the closed-loop systems. 
Moreover, structure-preserving model reduction approaches using balanced trun- 
cation or Krylov subspaces are presented and some error bounds are given. The 
internal inputs of the LTI subsystems are given by a linear combination of the 
other subsystem's internal outputs and the external input, the external output is a 
linear combination of all internal outputs of the subsystems. 

In contrast, in this paper we are dealing with nonlinear hierarchical systems 
which are much less studied, but of great importance. We do not consider an 
approach that is based on inputs and outputs for the coupling of subsystems as 
adequate to model an electronic circuit, since the subsystems are mutually inter- 
acting with each other. Hence, we did not try to extend these approaches. 

In [16], however, the author uses "variable sharing" and relations on system 
variables for modeling the behavior of the variables on the subsystems' external 
terminals to capture physical phenomena. This approach is applicable not only to 



^ A test bench is a simulation test environment. Here, one could take all the rest of the entire 

circuit as test bench. 

'" Of course, one could record the currents as well. 

By providing the voltages at all k ports, they work as the input to the subsystem. Then the 
currents flowing into the subsystem at the k ports are defined as the output and will be controlled 
by the specified error bound. 
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electronic circuits, but also to other systems. For example, in a hydraulic system, if 
two pipes are connected with each other, their pressure variables at the link are 
equalized while the sum of their flow variables is zero. ' Thus the variables at the 
link are shared. Also in our opinion, this approach is appropriate to capture all 
relevant physical effects of the interacting subsystems, while viewing intercon- 
nections as output-to-input assignments introduces a signal transmission mecha- 
nism that is not part of the physics of an electronic circuit. 

In our paper, we use a similar approach and, in addition, provide a sophisticated 
model reduction method for hierarchically structured nonlinear systems. 

Of course there already exist some methods like POD for nonlinear model order 
reduction. However, we could not find any other approaches for nonlinear model 
order reduction techniques that explicitly exploit the hierarchy of the overall 
system. We found some other structure-preserving approaches [2, 6, 12], but these 
were applied only to linear systems. They are based on Krylov space approxi- 
mations with a certain shape of the projecting matrices to preserve the structure of 
the original system in its reduced equivalent. 



15.6 Summary and Future Work 

In this paper, we considered two different approaches for the reduction of a dif- 
ferential-amplifier circuit. The first one reduced the entire circuit using methods as 
presented in Sect. 15.2 (cf. also [3, 14]). The second approach, introduced in this 
paper, exploits the circuit's hierarchy. Here we reduce the subsystems separately, 
then plug the reduced ones together, and thus obtain a reduced overall system. 

The new hierarchy-exploiting approach turned out to be the better choice, since 
we obtained much better results with respect to computation time. Moreover, the 
accuracy was almost the same, but the number of equations could be halved. 

The principal idea is to reduce the subsystems separately (either symbolically or 
numerically), each one allowing a certain approximation error e^.. If we plug together 
the reduced subsystems, we obtain an approximation for the entire system with a 
certain error e compared to the original overall system. If the total error £ or the degree 
of reduction appears not to be satisfactory, we restart the reduction of at least one 
appropriate subsystem with an altered error bound £^ More details can be found in [ 1 0] . 

One goal of possible future investigation could be finding a functional relation 
between the errors s^ and the error e of the entire system. The challenge, probably 
realistic only for special systems, would be obtaining an estimate for the sub- 
system errors as function of the user-specified error e. 
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The respective flows are directed towards the interior of each pipe. 
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Chapter 16 

On Stability, Passivity and Reciprocity 

Preservation of ESVDMOR 



Peter Benner and Andre Schneider 



Abstract The reduction of parasitic linear subcircuits is one of many issues in 
model order reduction (MOR) for VLSI design. This issue is well explored, but 
recently the incorporation of subcircuits from different modelling sources into the 
circuit model has led to new structural aspects: so far, the number of elements in 
the subcircuits was significantly larger than the number of connections to the 
whole circuit, the so called pins or terminals. This assumption is no longer valid in 
all cases such that the simulation of these circuits or rather the reduction of the 
model requires new methods. In Feldmann and Liu (ICCAD '04: Proceedings of 
the 2004 IEEE/ACM International Conference on Computer-Aided Design, 88-92, 
2004), Liu et al. (ICCAD '05: Proceedings of the 2005 IEEE/ACM International 
Conference on Computer-Aided Design, 821-826, 2004; Integr. VLSI J. 41(2): 
210-218, 2008) the extended singular value decomposition based model order 
reduction (ESVDMOR) algorithm is introduced as a way to handle this kind of 
circuits with a massive number of terminals. Unfortunately, the ESVDMOR 
approach has some drawbacks because it uses the SVD for matrix factorizations. In 
Benner (Mathematics in Industry 14, 2009), Schneider (Matrix decomposition 
based approaches for model order reduction of linear systems with a large number 
of terminals, 2008) the truncated SVD (TS VD) as an alternative to the SVD within 
the ESVDMOR is introduced. In this paper we show that ESVDMOR as well as 
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the modified approach is stability, passivity, and reciprocity preserving under 
reasonable assumptions. 

Keywords Model reduction • Model order reduction • Many terminals • Large 
number of inputs/outputs • Passivity • Stability • Reciprocity • Power grids 



16.1 Introduction 

MOR has become an important tool in the preprocessing of circuit simulation over 
the last decades. The original model which results from the mathematical mod- 
eling with the help of, e.g., modified nodal analysis [11] has to be simplified due to 
its complexity. One special part of this simplification for VLSI design is the MOR 
of pEirasitic linear interconnect circuits [7]. These circuits appear in form of sub- 
structures in the design of integrated circuits (ICs). They contain linear elements 
which do not necessarily have large influence on the result of the simulation. 

There are applications in which the structure of these parasitic linear subcircuits 
has been changing recently in the following sense. So far, the number of elements 
in these interconnect circuits was significantly larger than the number of con- 
nections to the whole circuit, the so-called pins or terminals. This assumption is 
not true anymore in many cases. Power grid networks which supply elements of 
large circuits with energy are of this special form [22, 24]. Often these power grids 
are realized as an extra layer of elements in between the layers of transistors. The 
fact that they need connections to the elements for power supply explains the high 
number of pins. The same problem appears in connection with synchronization. In 
clock distribution networks, the clock signal is distributed from a common point to 
all the elements that need it for synchronization [14]. For simulating these special 
subcircuits, new methods are needed. In some applications, a lot of terminal sig- 
nals behave similarly so that it is possible to compress the input-/output -matrices 
of the transfer function in such a way that the I/O behavior can be realized through 
a few so-called virtual inputs/outputs. As a consequence we deal with these virtual 
terminals, the number of which is much less than the original number of terminals. 
This allows the use of well known MOR methods like balanced truncation or 
Krylov subspace methods to reduce the number of inner nodes. The basic work 
was done a few years ago with the introduction of the (E)SVDMOR approach 
[6, 15-17, 23]. In this paper we review the existing ESVDMOR approach [6, 17] 
and discuss stability and passivity of the computed reduced-order model. As most 
linear subcircuits with a massive number of pins represent RC(L) circuits and 
contain no active devices, they are modelled as passive and thus stable systems. 
Therefore, it is generally important to compute reduced-order models that share 
these properties with the original model. 

In the following section, we review the fundamentals of the ESVDMOR 
approach. We introduce the moments of a transfer function of the circuit 
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describing system and show how to use the information in these moments in order 
to reduce the number of terminals. In this way we achieve a very compact model. 
The preservation of stability and passivity in the reduced-order models is the topic 
of Sect. 16.3. We introduce basic definitions and prove that the ESVDMOR 
approach is stability, passivity, and reciprocity preserving under certain conditions. 
A numerical example shows the passivity preservation. In Sect. 16.4, we sum up 
the results and outline future research activities. 



16.2 The Extended SVDMOR Approach 



We consider a linear time-invariant continuous-time descriptor system, which can 
be represented as 



Cx{t) = -Gx{t) + Bu{t), x(0) = xo, 
y{t)^Lx{t), 



(16.1) 



with C,Ge R"""", B e M"^'«i", L e R'"™'^", x{t) e M" containing internal state 
variables, u{t) e M"''° the vector of input variables, e.g., the terminal currents, 
y{t) <E M™™' being the output vector, e.g., the terminal voltages, xq e M" the initial 
value and n the number of state variables, called the order of the system. The 
original linear system (16.1) to be reduced has the following transfer function in 
frequency domain: 



H{s) =L{sC+GY^B. 



(16.2) 



The number of inputs m„^ and the number of outputs mout are not necessarily 
equal. Further on we can define the /th block moment of (16.2) as 



mi = L{-G-^C)'G-^B, 
in terms of mi as an Wout x m-m matrix 



0,1,... 



(16.3) 



mi 



'"1,1 



■'1,2 

4.2 



'l-niin 



(16.4) 



Note that the moments in (16.3) are equal to the coefficients of the Taylor series 
expansion of (16.2) about s — Q. The expansion about s — sq ^ Q leads \.o fre- 
quency shifted moments defined as 



mi{sQ) ^ L{~{sqC + G) 'C)'(5oC+G) 'fi, / = 0,1,. 



(16.5) 



In the ESVDMOR method [6, 17], the information of a combination of these 
moments to create a decomposition of (16.2) is used in the following way. For 
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being able to allow terminal reduction for inputs and outputs separately, w.l.o.g. 
we use r different block moments forming two moment matrices, the input 
response matrix M/ and the output response matrix Mq, as follows: 



M, 



mo 
mi 

nir 1 



Mr, 



mo' 



mr i' 



(16.6) 



Note that it is also possible to use different numbers of block moments to create 
Mi and Mq and that column k of Mj represents the coefficients (moments) of the 
series expansion of (16.2) at all outputs corresponding to input k. Similarly, each 
column k of Mq represents the coefficients of output k corresponding to all inputs. 
We assume the number of rows in each matrix to be larger than the number of 
columns. If not, r has to be increased. 

Applying the SVD to these matrices, we can obtain a low rank approximation 



Mi = Ui^iVj « [//,., E,,., Vl , Mo = Uo^oVl « Uq., ^o,^ Vq,, > 



(16.7) 



t//..,t/; 






is an r • Wout x r ■ OTqui orthogonal matrix, 
is an min x m^^ orthogonal matrix, 

is an r • mi„ x r ■ mi^ orthogonal matrix, 
is an wiout x OTqui orthogonal matrix. 



where 

• Vi = 

• V/ = 

• Vo = 

• Vo = 

• E/ and Zo are r ■ wiout x m\^ and r ■ m-^^ x Wout diagonal matrices, 
whereas 

• S/, and Eo, are r, x r,- and To x ¥„ diagonal matrices, 

• V/,. and Vo,. are mi,, x r,- and Wout x rg isometric matrices that contain the 
dominant column subspaces of M/ and Mq, 

• f//, and Uq,. are rmout x r, and rmi,, x r,, isometric matrices that are not used 
any further, r,- < «{„ and Vo < n^out are the numbers of significant singular values 
as well as the numbers of the reduced virtual input and output terminals. Due to 
the fact that the important information about the dependencies of the I/O-ports 
is hidden in the matrices Vj and Vq , approximations of B and L using the 
factors in (16.7) lead to 



BKiBrVf and L^Vo,.,Lr. 



(16.8) 



Here, B^ G M."^''' and L,. G W"^" are computed by applying the Moore-Penrose 
pseudoinverse [18] (denoted by (•)^) of Vf and of Vq,. to B and L, respectively. In 
detail, that means 
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B, = BV/,., {Vl 7/,., )"' = BVl+ = BV]^, (16.9) 

and 

Lr = (y^,._^yoJ"' v^^._^L = y+_^L = y^__^L, (I6.10) 

since V/,. and Vo^ are isometric. As a consequence we get a new internal transfer 
function //,-(■?) by using the approximation 

i/(s) « //(.s) = Vo Lr{G + sCy^BrVl . (16.11) 

"" V ' '' 

:=H,(.) 

This terminal reduced transfer function H^is) can be further reduced to 

H^{s) =l,{G + sCy^Brf^Hr{s) =L,.{G + sCy^B,. (16.12) 

by some established MOR method, e.g., balanced truncation or a Krylov subspace 
method, see [2, 4, 20] for introductions to linear model reduction techniques. We 
end up with a very compact terminal reduced and reduced-order model //,(.?), i.e. 

H{s) « H{s) « H,is) = Vo,M')Vl . (16.13) 

Note that the well-known SVDMOR approach [6] can be considered as a 
special case of ESVDMOR, using only one moment and one SVD, e.g. r = 1, and 
using mo as moment 

The ESVDMOR approach as described above is not appropriate for really large- 
scale circuit systems as it employs the full SVD of the moment matrices. Observing that 
the computation of the reduced-order model only needs the leading block-columns 
[//, , V7, , Uo, , Vo, of the orthogonal matrices computed within the SVD, the 
expensive SVD can be replaced by a truncated SVD which can be computed cheaply 
employing sparsity of the involved matrices using Krylov subspace or Jacobi- 
Davidson methods [12, 13]. Efficient algorithms based on the first approach are sug- 
gested in [5, 21] while the Jacobi-Davidson version is under current investigation. 

16.3 Stability, Passivity, and Reciprocity 

In this section we establish some facts on the preservation of stability, passivity, 
and reciprocity in ESVDMOR reduced-order models. 

16.3.1 Stability 

In order to discuss stability of descriptor systems we need the following definition 
and lemma which can be found, e.g., in [4]. 
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Definition 1 The descriptor system (16.1) is called asymptotically stable if 
\wci,^ryox{t) — Q for all solutions x{t) of Cx{t) = ~Gx{t). 

Lemma 1 Consider a descriptor system (16.1) with a regular matrix pencil 
aC + G. The following statements are equivalent: 

1. System (16.1) is asymptotically stable. 

2. All finite eigenvalues of the pencil kC + G lie in the open left half-plane. 

Using the results from Lemma 1 we are able to formulate the following 
theorem: 

Theorem 1 Consider an asymptotically stable system{\6.\) with its transfer 
function (16.2). The ESVDMOR reduced-order system corresponding to (16.13) is 
asymptotically stable iff the inner reduction (16.12) is stability preserving. 

Proof It is obvious that none of the approximations (16.8), (16.11) and (16.13) 
change the eigenvalues of IC + G. With Lemma 1 and the assumption that 
(16.12) is stability preserving it directly follows that the ESVDMOR approach is 
stability preserving. D 

A possible stability preserving model reduction method that can be applied along 
the lines of Theorem 1 is balanced truncation for regular descriptor systems, see [3, 4]. 



16.3.2 Passivity 

Showing that the ESVDMOR approach is passivity preserving turns out to be more 
difficult. First we note that a system is passive iff its transfer function is positive 
real [1]. The following definition of positive realness can be found, e.g., in [8]. 

Definition 2 The transfer function (16.2) is positive real iff the following three 
assumptions hold: 

1. H{s) has no poles in C+ = {s e |Re.v > 0}, i.e. the system is stable if addi- 
tionally there are no multiple poles on iM., 

2. H{s) = H{s) for all s e C, 

3. Re{x"H{s)x) > for all s e C+ and x e C". 

For passive systems we have to assume that the number of inputs is equal to the 
number of outputs: m^^ = m^M — m. Due to the fact that we often deal just with 
parasitic linear RLC circuits we furthermore assume L — B^ such that 

H{s) = B^{sC + GY^B. (16.14) 

As a result of Modified Nodal Analysis (MNA) modeling the system has a well 
defined block structure [8] and we get 



16 On Stability, Passivity and Reciprocity Preservation of ESVDMOR 

G, 



283 



Ci 


■ 


i: + 





Ci. 





'Gl 



Gi 


X ~ 


'B{ 











y={B, 0] 



Ok 



^16.15) 



where G\, C\, C2 are symmetric, Gi, Ci > (i.e., both matrices are positive semi- 
definite), and C2 > 0, i.e., C2 is positive definite. Moreover, as before, the matrix 
pencil }.C + G is assumed to be regular. It is easy to see that under these 
assumptions, H{s) is positive real and thus the system is passive [10]. 

Theorem 2 Consider a passive system of the form (16.15). The ESVDMOR 
reduced system (16.13) is passive iff the inner reduction (16.12) is passivity 
preserving. 

Proof If we can show that Hr{s) in (16.13) is positive real, we have shown that 
the reduced system is passive, see Definition 2. The moments of (16.15) are 

ini(.jo) =fi^(-(^oC+G)"'C)'(ioC+G)"'B, (16.16) 

with < io e M and det(ioC + G) ^ 0. 

in [9] we define 



J 

and /' 



Following a technique which can be found, e.g.. 
/ 







The properties of G], Ci, and C2 as well as the fact that J — J 
I lead to the following rules: 



Rl: J = r\ 

R2: JC = CJ, hence JCJ = C, 

R3: {sqC + Gf = sqC + JGJ = s^JCJ + JGJ, hence {sqC + G)'^ ■ 



{sqJCJ- 



JGjy =J-\sqC+GY'J 



-if-i 



J{sqC + G) '/, 



R4: B = JB, and 

R5: for every matrix X and Y, {~X-^Y)' = X-\Y{-Xy^)'X holds. 

A straightforward calculation employing these rules shows that 

mi^(^o) = {B''{~{soC+G)-'C)'{soC + G)-'bY 

= B^soC + g)-^{{~{soC + cr'ayVB 

^=^B^soC + G)-^{{soC + G)-\C{-{soC + G)-')Y{soC+G)yB 
- B^soC + G)-^{soC + Gf{{C{-{soC + G)-'))'Y{soC + GY^B 
= 5^{(C(-(.oC+ G)-'))'}^(.„C + G)-^B 



(c=c 



= ' B^((-(.5oC + G))-' C)\soC + G)-'B 
^^^B^{~J{soC + Gy'jC)'{J{soC+Gy'j)B 
=' B^J{soC + Gy\jC{~J{soC + G)"' ))'(.soC + G)J{J{soC + GY^J)B 
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'•'''^^'' B'^JisoC + G)-\C{~{soC + G)''))'{soC + G){soC + G)-'JB 

^ b'^j{soC + Gy\c{-{sQC + Gy^)yjB 

'=' B^isoC + G)-\C{-{soC + G)-'))'fi 

*^' B^{-{soC + G)-'cy{soC + G)-'b 
= mi{so). 



It follows from (16.6) that M/ = Mq, such that 



yl = yL = y' 



(16.17) 



(16.18) 



with the help of (16.7). It is easy to see that 

h{s) = v,.bJ.{g + sC)-'b,vJ, 

with Br analogously to (16.9). Hence, if the model reduction method used in 
(16.12) leads to a positive real transfer function of the reduced-order model, 
passivity is preserved. D 

Remark 1 As a simple numerical example we show a RC circuit called rc549, 
which is also investigated in [21] and the Section by Antoulas/Lefteriu in this 
booli. It is provided by the Infinoen Technologies AG, Munich, Germany. The 
order of the corresponding descriptor system is n — 141, the number of terminals 
is m = 70. Due to simplicity we use H^c = 'Mo(>J = 0) as basis for the matrices in 
(16.6) and we abstain from the reduction step in (16.12). Following Definition 2, 
we show that the Hermitian part of the transfer function on the imaginary axis is 
positive semi-definite, i.e., Hh = Hijco) + H(Ja))* >0. Figure 16.1 shows the 
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Fig. 16.1 Smallest eigenvalues of the hermitian part of the transfer function of rc549 depending 
on the frequency o) 
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Fig. 16.2 Relative difference correlated to the largest eigenvalue of the original transfer function 

smallest eigenvalue of Hh of the original transfer function and of the terminal 
reduced transfer functions H^is). Although the frequency range is much larger than 
important in applications, this smallest eigenvalue grows depending on how much 
terminals are reduced. The relative difference of these smallest eigenvalues of the 
reduced systems to those of the original system is shown in Fig. 16.2. We rec- 
ognize that this difference depends on the spectrum of the eigenvalues of H(s) but 
does not play a too important role in the aspect of passivity preserving. Note, that 
we add numerically zero eigenvalues in (16.13) which do not destroy the positive 
semi-definiteness of the transfer function. 



16.3.3 Reciprocity 



Another important property of MOR methods is reciprocity preserving, which is a 
requirement for synthesis of the reduced order model as circuit. We assume the 
same setting as in Sect. 16.3.2. An appropriate definition can be found, e.g., in [19]. 

Definition 3 A transfer function (16.14) is reciprocal if there exists mi,m2 G N 
with mi+m2=m, such that for E^, = diag(/m, , — /m,) and all jSC where 
H{s) has no pole holds 

The matrix Z^, is called external signature of the system. A descriptor system is 
called reciprocal if its transfer function is reciprocal. 



As a consequence, a transfer function of a reciprocal system has the form 

Hn{s) Hu{s) 
~Hl,{s) H22{s) 



H{s) 



(16.19) 
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where //„(.?) = Hf^is) e R'"'''"' and Hiiis) = H^2i^) e M."'-^'"-, which means that 
the transfer function is some kind of symmetric. 

Theorem 3 Consider a reciprocal system of the form (16.15). The ESVDMOR 
reduced system (16.13) is reciprocal iff the inner reduction (16.12) is reciprocity 
preserving. 

Proof Due to the reciprocity of the original system, the corresponding transfer 
function (16.14) has the structure given in (16.19). Equation 16.18 shows that none 
of the steps in ESVDMOR destroy this symmetric structure if (16.12) preserves 
reciprocity. D 



16.4 Remarks and Outlook 

We have reviewed the ideas of ESVDMOR and that this approach is stability, 
passivity, and reciprocity preserving under reasonable assumptions described in 
Sect. 16.3.2. As a direction of future research, it would be desirable to have a proof 
of passivity preservation for more general settings, e.g., a more general structure 
than in (16.15). 

In [5] it is pointed out that an explicit computation of the moments in (16.3) 
would be numerically unstable and too expensive. In order to avoid this problem, a 
numerical solution by using the truncated SVD based on the implicitly restarted 
Arnoldi method is presented there. As an alternative, we also investigate the 
Jacobi-Davidson approach (JDSVD) [12] for computing the TSVD. In this con- 
text, the computation of the residual and the solution of the correction equation 
within IDSVD are based on iterative methods. This offers a large potential for 
increased efficiency in ESVDMOR due to the usual structures of circuit equations 
that can be exploited here. 

An automatic error control which regulates the number of significant singular 
values as well as the best choice of the decomposition method and the number of 
used moments are other interesting topics for future research. 
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Chapter 17 

Model Order Reduction of Nonlinear 
Systems in Circuit Simulation: Status 
and Applications 

Michael Striebel and Joost Rommes 



Abstract In this paper we review the status of existing techniques for nonlinear 
model order reduction by investigating how well these techniques perform for 
typical industrial needs. In particular the Trajectory Piecewise Linear-method and 
the Proper Orthogonal Decomposion approach are taken under consideration. 

Keywords Model order reduction • Nonlinear systems • Proper orthogonal 
decomposition • Piece-wise linearisation 



17.1 Introduction 

The dynamics of electrical circuits can generally be described by a nonlinear, first 
order, differential-algebraic equation (DAE) of the form: 

|q(x(0) + j(xW) +Bu(r) = 0; y(f) = L^x(f), (17.1) 

where x(f) £ W represents the unknown vector of circuit variables at time f € R; 
q, j : K." -^ R" describe the contribution of reactive and nonreactive elements, 
respectively; B e R'""" distributes the input excitation u : R ^ R'" and L e R""^ 
maps the state x to the system response y(r) e R*. We interpret (17.1) as 
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description of a subcircuit. Thus the input u and the output y are terminal voltages 
and terminal currents that are injected and extracted linearly. 

The dimension n of the unknown vector x{t) is of the order of the number of 
elements in the circuit, which can easily reach hundreds of millions. Therefore, 
one may solve the network equations (17.1) by means of computer algebra in an 
unreasonable amount of time only. 

Model order reduction (MOR) aims to replace the original model (17.1) by a 
system 

^ [q(z(f))] + j(zW) + Bu(r) = 0; y(f) = L^x(f), (17.2) 

with z(f) e M''; q, j : R'' -^ W and B e R''"" and L e R'''"^, which can compute a 
system response y(f) e R' that is sufficiently close to y(f) given the same input 
signal u(?), but in much less time. 

In the following we recall the fundamental problem that arises in nonlinear 
model order reduction. We overview different nonlinear model order reduction 
techniques and their field of application. For two of these methods, proper 
orthogonal decomposition (POD) [12, 14, 16] and trajectory piecewise lineariza- 
tion (TPWL) [15, 21] we will go more into details and report observations we 
made when applying them to different test examples. 



17.2 Linear Versus Nonlinear IVIodel Order Reduction 

So far most research effort was spent on developing and analysing MOR tech- 
niques suitable for linear problems. For an overview on these methods we refer to 

[1]. 

When transferring approaches from linear MOR, especially projection based 
methods, fundamental differences emerge. Considering a linear system 

C-x(f) + Gx(r)+Bu(f) = 0. with C,G e R"''", 
at (17.3) 

y(f)-L^x(f), 

usually the state x(f) is approximated in a lower dimensional space of dimension 
r ^ n, spanned by basis vectors which we subsume in V = (vi, . . ., v^) € R"^': 

x(f) « Vz(/), withz(f) e R". (17.4) 

The reduced state z is defined by a reduced dynamical system that arises from 
projecting (17.3) on a test space spanned by the columns of W. There, W and V 
are chosen biorthonormal, i.e., W^V = I,.xr. The Galerkin projection' yields 



' Most frequently V is constructed to be orthogonal, such that W = V can be chosen. 
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c|zW + Gz(f)+Bu(f) = 0, ^^^^^ 

y(f)=L^z(r), 

with C = W^CV, G ^ W^GV G W' and B = W^B e M''^™, L ^ V^L G Wp . 
The system matrices C,G,B,L of this reduced substitute model are of smaller 
dimension and constant, i.e., need to be computed only once. 

Applying the same technique directly to the nonlinear system means obtaining 
the reduced formulation (17.2) by defining q(z) = W"^q(Vz) and j(z) = W"^j(Vz). 
Clearly, q and j map from 'W to W . 

Multistep methods to solve (17.2) cause at each time point f; a nonlinear 
equation 

aq(z,)+^+j(z,)+Bu(f,)=0, (17.6) 

that defines Z/ which is the approximation of z(r/). In the above equation a is the 
integration coefficient of the method and fS ^W contains history from previous 
timesteps. Newton techniques that are used to solve (17.6), usually require an 
update of the system's Jacobian matrix in each iteration v: 



''^"^4-1 



w^ 



8q , 8j' 

ox ox 



V. (17.7) 

x('')=Vzi''' 



The evaluation of the reduced system, i.e., q and j, necessitates in each step the 
back projection of the argument z to its counterpart Vz followed by the evaluation 
of the functions q and j and the projection to the reduced space with W and V. 

Consequently, with respect to computation time no reduction will be obtained 
unless additional measures are taken or other strategies are pursued. 



17.3 Some Nonlinear MOR Techniques 

In MOR for linear systems especially methods based on Krylov subspaces [7] and 
balanced realization [13] are well understood and highly elaborated. In the fol- 
lowing, we shortly describe how these approaches are adapted to nonlinear 
problems and give references for further reading. 



17.3.1 Krylov Subspace Methods in Nonlinear MOR 

In linear MOR, Krylov subspace methods construct order reduced models of the 
system (17.3) such that the moments of the transfer functions of full and reduced 
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problem match up to a certain order. The transfer function H : C -^ C is defined 
by the Unear equation H(,s) = L^{sC + G)"'B. 

As there is no direct counterpart of the transfer function for the nonUnear 
problem (17.1), Krylov techniques are applied to systems derived from the general 
problem: 

bilinear system: —x{t) + Ax{t) + Nx(f)u(f) + Bu(f) = 0, (17.8a) 

dt 

LPTV system: - [C(f)x(f)] + G(f)x(f) + Bu(f) = 0. (17.8b) 

dt 

The problem (17.8a) arises from expanding x{t) + f (x(r)) + Bu(f) = around 
an equilibrium point. Linear periodically time varying systems of type (17.8b), 
with matrices C(f), G(f) that are periodic with period T, one gets when linearizing 
the system (17.1) around a periodic steady state solution with x"(f + T) = x"(f). 

For bilinear systems, Volterra- series expansion and multimoment expansions are 
the key to apply Krylov subspace based MOR. For further reading we refer to [6]. 

For LPTV systems, a time varying system function H{s, t) can be defined, 
where .s e C corresponds to the frequency of the input and t e [0, T) is used to 
capture the system's variation. The transfer function H is defined by a differential 
equation. Choosing an expansion point s, H(i, t) can be determined using time- or 
frequency samples on [0, T). We refer to [9] and the references therein. 

The techniques mentioned do not directly work on the nonlinear problem (17.1) 
of dimension n. The bilinear system that is reduced actually has a dimension of 
n + rp- + it' + ■ ■ ■, depending on the order of expansion. Similar, the system in the 
LPTV case has dimension p ■ n with p being the number of time- or frequency- 
samples used to approximate H(s, t). 



17.3.2 Balanced Truncation in Nonlinear MOR 

The energy Le(xo) that is needed to drive a system to a given state xo € M" and the 
energy Lo(xo) the system provides to observe the state xq it is in, are the main 
terms in balanced truncation. A system is called balanced if states that are hard to 
reach are also hard to observe and vice versa, i.e., Lc(x) large implies L„{x) small. 
Truncation, i.e., reduced order modeling is then done by eliminating these states. 
For linear problems Lc and L,, are connected directly to the reachability and 
observability Gramians P and Q that are computed from Lyapunov equations, 
involving the system matrices of the linear system (17.3). Balancing is reached by 
transforming the state space such that P and Q are simultaneously diagonalised. 
The entries ci, . . ., (T„ on the diagonal are called Hankel singular values. From the 
basis that arises from the transformation only those basis vectors that correspond to 
large Hankel singular values are kept. The main advantage of this approach is that 
there exists an a priori computable error bound for the truncated system. 
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In transferring balanced truncation to nonlinear problems, three main tracks can 
be recognized. Energy consideration is tiie common ground for the three 
directions. 

In [8] the energy functions arise from solving Hamilton- Jacobi differential 
equations. Similar to the linear case, a, now state dependent, state-space trans- 
formation is derived such that Lc and Lo are formulated as quadratic form with 
diagonal matrix. The magnitude of the entries, the singular value functions, are 
then basis to truncation again. As the Hamilton- Jacobi system has to be solved and 
the varying state-space transformations have to be computed, it is an open issue, 
how the theory could be applied in a computer environment. 

In sliding interval balancing [20], the nonlinear problem is first linearized 
around a nominal trajectory, giving a linear time varying system like (17.8b). At 
each state finite time reachability and observability Gramians are defined and 
approximated by truncated Taylor series expansion. Analytic calculations, basi- 
cally the series expansions, connect the local balancing transformation smoothly. 
This necessary step is the limiting factor for this approach in circuit simulation. 

Finally, balancing is also applied to bilinear systems (17.8b). Here the key tools 
are so called algebraic Gramians arising from generalised Lypunov equations. 
However, the algebraic Gramians can serve only to get bounds for Lc and Lg. 

For further details we refer to [3, 5] and the references therein. 



17.4 TPWL and POD 

In view of high dimensional problems in circuit simulation and feasability in a 
computational environment, trajectory piecewise linearization (TPWL) [15] and 
proper orthogonal decomposition (POD) [14] are the most promising approaches 
for the time being. 



17.4.1 Trajectory Piecewise Linearization 

The idea of TPWL [15] is to reproduce the typical behaviour of the full nonlinear 
system (17.1) by a varying combination of a set of order reduced linear models. 
For this purpose a training input u(f) for t e [fo,4] is chosen that drives the 
system into typical states, i.e., situations. A transient simulation with the chosen 
input yields a collection of points xq, ■ ■ ■^'^N, approximating the trajectory x(f,) at 
timepoints fo<?i < • • • <?a' = ?c- On the trajectory, points {xj'", . . .,x''"} C 
{xq, . . .jX^f} are chosen around which the nonlinear functions q and j are linear- 
ized. The linear models, that are all of dimension n, are reduced individually. This 
delivers local reduced subspaces. A common subspace of dimension r -^ n, which 
is represented by the columns of V G M"^'^, is constructed that describes the pri- 
mary information of all local subspaces. All linear models are projected on this 
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space. Finally a mechanism, a weighting w,(Vz) G [0, 1] for / = 1, . . .,5 is intro- 
duced to decide which linear submodel is valid in a certain situation. And the full 
system is replaced by 



£w,-(Vz 



i=0 



Y^CCVjZ + V^G,-Vz + V^(j(x;.'") - C-xf") 



+ V^Bu(0 ^ 0, 

(17.9) 



that determines z G M' such that Vz « x. Here C, — gj 



andG,=| 




The formulation (17.9) is taken from [21]. We find a slightly different setup in [15]. 

Selection of linearization points: The first crucial point in TPWL is to decide, 
when to add a linear substitute for the nonlinear problem automatically during the 
training simulation. Too many linear models could make the final model slow, too 
few could make it inaccurate. 

In literature we find two different strategies to decide upon adding a new model: 

• In [15] the author suggests to check at each accepted timepoint t during sim- 
ulation for the relative distance of the current state x to all yet existing i line- 
.,x|'". If the minimum exceeds the bound ^ > 0, i.e.. 



(17.10) 



X becomes the (/ + l)st linearization point. Accordingly, a new linear model, 
arising from linearizing around x is added to the collection. 
• In [21] the mismatch of nonlinear and linear system motivates the creation of a 
new linearization point and an additional linear model: during training both the 
nonlinear and the lastly constructed linear system are computed in parallel. If 
the difference of the two approximations to the true solution at a timepoint t„ 
becomes too large, a new linear model is created from linearizing the nonlinear 
system around the state the system was in at the previous timepoint f„_i. 

Reducing the linear models: Basically, any linear MOR technique can be applied 
to the s linear systems. In [15] e.g., a Krylov-subspace method is used. However, the 
effect of reducing linear submodels on the overall accuracy in time domain simu- 
lation and computational work is an open question. Also, the additional source terms 
j(x|."') — G,xJ.'"in(17.9)and observations from experiments(see Sect. 17.5) give rise 
to the question, if and how smoothness of the solution can be guaranteed. 

Determination of the weights: After having done the training, a set of linear, 
reduced, models are available to replace the nonlinear problem. In (17.9) this is 
done by a convex sum of the models at hand. To have minimum complexity one 
aims at having to deal with a combination of just a small number of linear 
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submodels. Hence, the weighting function is chosen such that only a few (in the 
ideal case just one) dominant linear model is chosen: 

w,-(x) =e""ll''"''i'°ll, fori^l,...,s,withm^ min ||x-x;'"||. (17.11) 



The constant fi adjusts how abrupt the change of models is. A typical value is 
/^ = 25. To guarantee a convex combination, the weights are normalized such that 

E,-H'/(X) = 1. 



17.4.2 Proper Orthogonal Decomposition and Adaptions 

Similar to TPWL, a training input u is chosen and a transient simulation is run. 
During this numerical benchmark integration of the nonlinear problem (17.1) on 
the training interval [fo, t^] K snapshots of the state x(^) are collected in a snapshot 
matrix 

X-(xi,...,x^) eR'"^'^. (17.12) 

The snapshots, i.e., the columns of X, span a space of dimension k<K. We search 
for an orthonormal basis {vi , . . ., vj} of this space that is optimal in the sense that 
the time-averaged error that is made when the snapshots are expanded in the space 
spanned by just r<k basis vectors to x,..,, 

1 '^ 

(||x — Xj-IIj) with the averaging operator (f)=— N^f; (17.13) 

1=1 

is minimized. This least squares problem is solved by computing the eigenvalue 
decomposition of the state covariance matrix |:XX^ or, equivalently by the sin- 
gular value decomposition (SVD) of the snapshot matrix 



ffi 



X^UST with UeK"^",TGM'^^'^ and S 



(17.14) 

where U and T are orthogonal and the singular values satisfy ffi > cr2 > • • • cr,, > 0. 
Note, that we assumed K > n in (17.14). 

The matrix V G R"^*^ whose columns span the reduced subspace is now build 
from the first r columns of U, where the truncation r is chosen based on energy 
considerations [14]. 
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For V, it holds V^V = Irxr- Therefore, Galerkin projection as described above 
can be applied to create a reduced system (17.2). However, as mentioned in 
Sect. 17.2 the cost for evaluating the nonlinear functions q, j is not reduced. 

Missing point estimation (MPE) [2], adapted POD [19], the reduced-basis 
method [10] and Discrete Empirical Interpolation Method (DEIM) [4] offer 
techniques to overcome this problem. Basically, besides constructing the r 
dimensional subspace of W spanned by the columns of V, the number n of rows is 
reduced, i.e., V G M"'"' is replaced by P[V = V =G W^'' with a selector matrix 
Pi; e {1,0}"^^. In this way, the «-dimensional function q (similar j) is replaced by 
some Pj'q = q : R" -^ R«. 



17.5 Numerical Examples 

For testing purposes, a time-simulator, has been implemented in octave. The 
underlying DAE integration scheme used here is CHORAL [11], a Rosenbrock- 
Wanner type of method, adapted to circuitry problems." 

Our main questions for investigating MOR techniques for nonlinear problems, 
intending to use these approaches on an industrial scale, are: 

• How sensitive is the reduction w.r.t. heuristics? 

• How sensitive are reduced order models w.r.t. to changing input signals? 

• Are there classes of circuits where the reduction does not work? 

We stress that we do not intend to let the methods appear in bad light but rather 
put our results to discussion in order to find explanations and improvements. The 
authors are open for discussion on the presented work. 



17.5.1 Tes teases 

The nonlinear transmission line in Fig. 17.1 is taken from [15]. It has dimension 
100 and contains nonlinear diodes whose currents are modelled by 
id{v) ~ exp(40 • v) — 1, where v is the voltage drop between the diodes' terminals. 
The inverter chain depicted in Fig. 17.2 is taken from [21]. It has dimension 104 
with a transistor whose drain-source current is modelled by ids = 2 • 10^^- 
max{ug — Us — 1.0,0) — max{ug — uj — 1.0,0) , where Ug,Ud,Us denote the 
gate-, drain- and source-voltage, respectively. 



Similar results are obtained from a matlab-implementation using odelSs as integration scheme. 
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Fig. 17.1 Nonlinear transmission line 



Fig. 17.2 Inverter chain 
[21] 




1 



17.5.2 Testruns 



Nonlinear transmission line: As in [15] for extracting a model a shifted Heavi- 
side function was used as training input. Resimulation is done both with the 
training input and with a cos-function on the interval [?startj fend] = [0, 10]: 



"train (?) ^ H{t - 3) = 



t<3 

1 t>3' 



.w4(i 



cosl-r 



The TPWL-model was extracted with the Arnoldi-method as suggested in [15], 
leading to a order reduced model of dimension 10. For choosing linearization 
points, the strategy (17.10) with 5 ~ 0.00167 has been tested. With this setting, 20 
linear models are constructed. Also the extended strategy described in [21] is 
implemented, but does not show much different results for the transmission line. 
A more detailed discussion on the model extraction and statistics on which models 
are chosen can be found in [18]. 

Concerning the first two questions posed in the beginning of this section, i.e., 
the influence of heuristics and the impact of changing input signals, we turn to the 
graphs shown in Fig. 17.3. On the left part we see what happens when we feed the 
TPWL-model (17.9) with the 20 models just created and weights being determined 
according to (17.1 1) applying both the maximum norm and the Euclidean norm. In 
the top graph we see that in neither case the trajectory of the voltage at node 1 can 
follow the true solution correctly. The kinks, evident especially when taking the 
2-norm for measuring the distance, suggest that there are some difficulties with 
choosing the weight, i.e., switching the responsible substitute model. Indeed, as the 
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Fig. 17.3 Transmission line: different settings — trajectories (top) and weights {bottom) 

lower picture shows, the importance of models is fluttering and unlike predicted 
we have to deal with several models of equivalent importance in (17.9). 

The right column of Fig. 17.3 shows trajectories that are closer to the reference 
solution. This improvement was achieved by letting the TPWL model (17.9) 
choose from not the full set of 20 submodels but only three of them. However, this 
tunings was done manually, i.e., testing different settings. Using different models 
or increasing the number of models to choose from, steers the system again to the 
situation depicted on the left of Fig. 17.3. We note that by changing b in (17.10), a 
smaller number of linear submodels is constructed but these were not able to 
represent the full problem sufficiently accurate. 

By the observations made, TPWL appears to be very sensitive to heuristics and 
implementation details. 

Also a POD model as well as a POD model that has been modified with the 
DEIM algorithm [4] has been extracted. In both cases the state-space was reduced 
to dimension 4. Applying the DEIM algorithm q and j of dimension 100 where 
replaced by q, j of dimension 5. We abandon to show a corresponding graph for 
the POD/POD-DEIM case as for node 1 the trajectories cannot be distinguished by 
inspection. 

Table 17.1 gathers the performance of the models, measured in time con- 
sumption and L2-norm of the error of the reconstruction of the node voltage at 
node 1 and, exemplary, at node 50. As predicted in Sect. 17.2, the plain POD- 
projection creates a reduced model that is even more expensive than the full 
system. The DEIM adapted POD model is superior to TPWL as no decision about 



' The models used arise from linearization around the states the system was in during training at 
f e {0.0,3.01943,3.04244}. 
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Table 17.1 Transmission line: performance of nonlinear MOR techniques 

Time for simulation L2-norm of error reconstructing 

with cos input [s] node 1 [10"*] node 50 [10"'' 



Full problem 4.16 

TPWL all models, 2- (inf-) norm 3.79 (3.23) 

TPWL preselected, 2- (inf-) norm 2.88 (3.28) 

POD model 4.34 

POD-DEIM model 1.47 



26.50(10.72) 

10.86(7.86) 

3.68 

3.76 



0.76(0.29) 
0.44 (0.47) 
9.18 
9.82 



which model to use has to be made. Concerning the errors, the third column 
reflects the observations from Fig. 17.3. However, being interested in a different 
output, e.g., the voltage at node 50, the rating of the approaches can change and the 
TPWL- are superior to the POD-models. Note, that here the output, i.e., L in 
(17.1), is not taken into account in reduction, neither for TPWL nor POD. 

Inverter chain: As suggested in [21], a piecewise linear voltage source 

C/in = M(r)withM(0) = 0, M(5ns) = 0,M(10ns) = 5,M(15ns) ^ 5,M(17ns) = 0. 

was used for training purposes. Resimulation is done with both the training signal 
and a signal where the input impulse repeats after each 30 ns. From Fig. 17.4 it is 
obvious that both TPWL- and POD-model can not accurately reproduce the 
behaviour of the nonlinear system. The trajectories produced by both models — 
TPWL reduced to dimension 50, POD to dimension 28 — wiggle notably. And the 
TPWL-model even misses every other impulse running through the system. 

The inadequacy of the presented MOR techniques for the inverter chain 
becomes also obvious from the performance of the approaches in comparison to 
the work necessary for treating the full problem as displayed in Table 17.2. 



Fig. 17.4 Inverter chain: 
repeated pulse, inverter 6 




60 80 
time [ns] 



Table 17.2 Inverter chain: 
perfomiance of nonlinear 
MOR techniques 



Repeated impulse Accepted (rejected) timesteps 



Full problem 677.5 s 
TPWL model 510.8 s 
POD model 829.4 s 



1074(182) 
1437(824) 
1276(104) 
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The reason for this behaviour might be found in the class of system the inverter 
chain belongs to. Here, a signal passes through the system activating at each time 
point a different set of elements, i.e., states and leaving the others untouched until 
they are activated. Hence, we are concerned rather with a redundancy in time than 
one in space. Here multirate methods (e.g., [17]) might be used to accelerate 
computation. 



17.6 Discussion and Outlook 

Although a variety of theoretically appealing MOR techniques for nonlinear 
problems exists, TPWL and POD seem to be the most practicable approaches. 
However, as our investigations show, care has to be taken when these techniques 
are being applied, as the approaches seem to be very sensitive to various heuristics. 
Furthermore, as seen with the inverter chain there are circuit problems where 
model order reduction is not viable. For TPWL there seems to be space for 
improvement in selecting linearization points and combining the submodels. Here 
it could be especially interesting to investigate the impact of the sharp, non- 
smooth, switching of the submodels on the smoothness of the final solution. The 
POD adaptions, originating from MOR for PDEs, deserve closer attention to adapt 
the ideas suitable for specially structured right-hand-sides to the rather compli- 
cated mixed structures in circuit problems. 
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Chapter 18 

An Approach to Nonlinear Balancing 

and MOR 



Erik I. Verriest 



Abstract For linear systems we recall the notion of sliding interval balancing 
(SIB). It is shown that subsystems conserve stability. The truncated SIB realization 
bounds the optimal approximation in an input-output sense. Nonlinear balancing is 
approached by applying SIB on the linear variational system. 

Keywords Balanced realization • Gramians • Lyapunov equation • Model reduction 



18.1 Static Versus Dynamic Approximation 

The problem considered in this chapter is the reduction of a system represented in 
state space form by i = /(x, u) and output y — h{x) by a system of lower dimension, 
say S, —f{^,u), and output given by h{^). We get some queues from the approxi- 
mation of paramere Wzecf ipace cMrve^, where each fixed tu e Q, </»(•, (m) : [0, 1] ^ R" 
corresponds to a curve in R" with line coordinate t. If one considers all curves in Q, 
according to some measure fx, the problem amounts to finding a submanifold M of 
suitable dimension p < n such that (f>{-,-) G M while minimizing 
1 

/ ||(/)(f,ffl) -(/)(f,ffl)|pd/i(ffl)dt. (18.1) 

men 

If M is a subspace, this is known as the Galerkin projection method. For linear 
dynamical systems, it has been proposed to approximate the response of the state 
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to an impulse input in the above sense. Clearly, this neglects any state to output 
properties the system may have. See for instance [1, 6]. In order to obtain a better 
balance between the input-to-state and state-to-output properties of a system, the 
notion of a balanced realization was introduced by Moore [3]. 

18.2 Gramians for Linear Systems and Applications 

In this section we first derive some structural properties for analytic time varying 
systems, and prove a stability conservation property. These properties will be 
utilized in what follows to derive structural properties of smooth nonlinear systems 
via the linear perturb ational system along a nominal trajectory, as this perturba- 
tional system is time varying in general. 



18.2.1 Analytic Time Varying Systems 

Let (5 > be given, and consider a linear time varying system (A(t), B(t), C(t)). The 
finite time sliding interval reachability and observability Gramians are defined by 

M{t,5)^ f (t>{t,x)B{x)B'{T)0'{t,r)dr. (18.2) 

t-d 

t+s 
&{t,S)= I <D'(T,f)C'(T)C(T)<D(T,f)dT. (18.3) 

t 

where <!>(?, t) is the transition matrix, satisfying for all t and T,|yO(f, t) = 
A(f)<l)(f, t), 0(t, t) = /. The Gramians respectively satisfy the time variant Lyapu- 
nov equations (where x+ — t + 5 and x_ — t — 5) 

k{t, 8) ^A{t)PA{t, 8) + PA{t, 8)A'{t) + B{t)B'{t) ~ <D(/, T_)fi(T_)5'(T_)(D'(/, t_) 
-&{t, 8) =A'(f)ff(f, 8) + (9{t, 8)A{t) + C'{t)C{t) - <D'(t+, r)C'(T+)C(T+)(D(T+, f). 

For analytic systems, we have the convergent Taylor series 

<l.(r, t)5(t) = ^i^ [(A - D)'B], (18.4) 



<l.'(T,r)C'(T) = g (L^ [(A' + D)'C'], (18.5) 



where D is the differentiation operator (acting to the right). 
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Defining local reachability and observability matrices: 

Roo(f) = [B, (A - D)B, (A - JifB, . . ], (18.6) 

O'^(r) = [C, (A' + D)C', (A' + D)^C', . . .], (18.7) 

the Gramians can be computed via termwise integration, and by convergence of 
the Taylor series (for analytic systems) approximated by their truncation: 

^{t -5,t)^ R^(0A„(<5)R'^(f) = R„(r)A,„(,5)R:,(f) + 0(^"+') (18.8) 

G{t,t+5)^0'^{t)^p{5)OUt)^0'„{t)^p{mn{t) + 0{5"+') (18.9) 

where Am() is a block matrix, with yth block 

[^"■(^)]^^ (,^l)!a'^l^)!U;-l) ^- ^''-''^ 

These Gramians are weighting matrices for energies and uncertainties [7]. 



18.2.2 Sliding Interval Balancing 

In order to obtain a uniform measure for the relative importance of subspaces 
(w.r.t. input and output), a state space transformation T{-) is found such that the 
new input to state and state to output maps are symmetric, i.e., -Si, ~ (D, and both 
are diagonal. The rationale behind this is that in balanced coordinates, for each 
coordinate direction (more general, for each subspace) the degrees of reachability 
and observability are equal. Using the fact that the Gramians transform under the 
similarity T{-) as 

0lt ^ T{t):^,T{ty (18.11) 

0,^T{ty^(9,T{ty'^ (18.12) 

it was shown that, for analytic systems, a diffeomorphic transformation r(-), 
pointwise defined, exists that brings the original time varying system to the balanced 
form. The resulting realization is called sliding interval balanced (SIB) and the 
common Gramian, A(f) = diag[li(f), . . .,/l„(r)], with /l,(?) > /l,+i(r), the canoni- 
cal Gramian. Note that the A^it) are the eigenvalues of the product C?,.^,. Because of 
the requisite ordering, this can only be described piecewise if umbilic crossover 
points exist. For details, see [8, 13]. 

When balancing was first introduced by Moore (1980), only LTl realizations 
were considered, and balancing in the sense of Moore corresponds to SIB for 
^ ^ oo, asymptotic stability being implied. A balanced realization is obtained 
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(it is not unique) by determining a T^ai which simultaneously diagonalizes and 
equalizes the Gramian matrices which satisfy the algebraic Lyapunov equations 



A!m 



■GA^ 



-BB' = 
C'C = 0. 



(18.13) 
(18.14) 



18.2.3 Properties of SIB(5) for LTI Systems 

It was shown in [11] that for siso systems, the SIB((5) realizations are sign sym- 
metric, i.e. for some signature matrix, S, we have A' — SAS, Sb = c' . Conversely, 
any sign symmetric realization is SIB. Likewise, any SIB is M-balanced, which 
implies also a time weighted measure for energies and/or uncertainties. A system 
with reachability and observability matrix, respectively R and O, is M-balanced, 
where M — M' > if the M-Gramian matrices Mm — RMR' and C = O'MO are 
equal and diagonal [11]. 

Once a balanced realization is determined, a reduced order model is determined 
by projection of dynamics (POD). This method is known as balanced truncation. 



All An 

All A22 
[Ci C2] 





Bi' 


\ I 

\ POD 


All 


0' 




'B{ 




.B2_ 


H 














[Ci 







All 
Ci 



fil 



We prove here a new property, which generalizes the stability result of Moore. 

Theorem 1 All truncated SIB-realization of an asymptotically stable, minimal 
system (A, B, C) are asymptotically stable for sufficiently large 8. 



Proof In balanced coordinates, the canonical Gramian satisfies 



AA + AA' + BB' - d^^BB'd^ ^ = 



M 



A'A + AA + C'C - d^'^C'Ce' 
Adding, gives for Aj, the symmetric part of A, 

AjA + AAj + Q{8) = 



(18.15) 
(18.16) 

(18.17) 



where Q{5) = \ {(BB' + C'C) - e^^ BB' e.^' ^ - e.^' ^ C Ce.^^] . By asymptotic stability 
of A, there exists 8q>Q, such that for all 8 > 8q, ||e'^'^||<l. Consequently, 
Q{S) > 0, a property inherited by any principal submatrix of Q{S). In particular, if 
{Ar,Br, Cr) is a balanced truncation, and A, and Q,^ are the consistent truncations 
of A and Q{8), we get 



(A,).,A, + A,(A,), + Q,.i8) = 0. 



(18.18) 
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Minimality implies A > 0. By the strict positive definiteness of A^, the solution of 
this Lyapunov equation for A„ can be represented in the form 



(A,), = - / e-'^''Q,(^)e-'^''dt. (18.19) 



Since the real parts of the eigenvalues of A^ are bounded by the minimal and 
maximal eigenvalues of its symmetric part (A,.)^, stability follows. D 

Remarks 

1 . The balanced truncation of an SIB((5) realization is in general not balanced (SIB 
or otherwise) unless S = co. This is analogous to the discrete LTI case. 

2. Equation 18.17 and thus also (18.19) remain valid for time variant SIB real- 
izations, if Q is replaced by the appropriate Q{t, 5). Then the balanced trun- 
cation gives pointwise 

oc 

[AMs = -[ ^'"'^'^'Qrit, <5)e-'^'-W^ dT. (18.20) 



If for some d it holds that Q(t, S) > for all t, then the spectrum of A,.(f) lies in the 
open left half plane. It is known [15] that if the largest eigenvalue of [Ar(f)]^ 
satisfies 



lim / ;.„,ax(f) df = -oo (18.21) 

t^oo J 

to 

then the null solution of the SIB-truncated system is asymptotically stable, and 
uniformly asymptotically stable if property (18.21) holds uniformly with respect to 
to- If the above integral is merely bounded by M(to), the null solution is stable and 
uniformly stable if M does not depend on Iq. 



18.3 Metric Properties of Balanced Truncation 

In what sense is the balanced truncation close to the original system? In the LTI 
case, it is known that the H^ norm of the difference between the transfer functions 
is upper bounded by twice the sum of the neglected singular values [4]. In the time 
varying and general nonlinear case, this metric cannot be used. Willems [14] 
argued that a natural distance between two controllable LTI systems should be 
based on the distance between their L2 behaviors, which are distances between 
subspaces. A convenient choice is therefore the gap-metric. This inspired us to 
pose the approximation problem as an LQ problem in the state space. 
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18.3.1 A Natural Problem and a Simplification 

Assume a system is given in state space form by the triple (A, B, C). Let its initial 
condition be xq e M". The objective of model reduction is to model the input-output 
behavior of this system by one with a reduced state space, say of dimension r. 
This means that we postulate a form 

i^A,.x + B,.u (18.22) 

y = C,.x. (18.23) 

with dim x = r<n, and with m and y close to the given data (u, y) consistent with 
(A, B, C). In general {u,y) ^ (m,.v), as it is unlikely that the equations (18.23) are 
consistent with the given data. A reasonable objective function seems to be 

J= f \\u-uf + \\y~y\\-dt. (18.24) 

where S' is the entire interval where the data is available. The optimization is over 
all systems (A^, B,-, C^), adjusted inputs m, and initial state xq. The latter completely 
determine then y. Note that / depends on the given data u and the initial condition 
Xq, the output y being determined by these. The above sketched problem has only a 
unique solution up to similarity: If (A^jBri C^, m,-^o) is a solution then so is 
{TA,.T-\TBr, CrT-\u, Txo) for any T € G4(M). 

In principle, this problem can be solved in two steps: First fix Z^ = {Ar,Br, Cr) 
and optimize J over u and xq, resulting in an optimal performance /(Sr). Next, 
minimize over all S^. This may further be extended: y(Xr|M,xo) may be averaged 
over several input and initial state combinations, or even a continuum of them. 
Then optimize the average of these optima, /avg(Sr)> over all S^. The first step is a 
typical LQ problem, and its solution requires the solution of a Riccati equation, 
which we would like to avoid. Therefore we simplify the performance index and 
setup of the problem. Instead of simultaneously trading off the adjustment of u and 
y, let's stagger the penalties. 

What to use as training data (m, 3;)? Of course this may be a mote point if there 
is no control over the system, as for instance when only causal relations are 
analyzed without being able to influence. If the input variables u can be freely 
chosen, then let it be the input of minimal energy (<»„ = /||m|| df) which sets 
up the state xq from the zero state, say in subinterval .^_, and which is zero 
after that in subinterval ^+, with ,^_ U .T+ = .T. Hence ^^ = lr\\u - u\\^ dt ^ 

The output energy, if,. = J \\y\\ dt is only measured after that, again for reasons 
of mathematical simplicity. Furthermore if m = 0, we shall insist that then also u 
must be zero. Letting ^_ = (—(5,0) and ^+ — (0, 8) we arrive at 
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u^u\\^dt+ / Wy^yW'-dt. (18.25) 



Note that the input u is parameterized by the state xq set up at time zero. In turn, 
this state determines the output y completely, so that the optimal J in (18.25) is 
also parameterized by xq. Because of the LQ nature of the problem, / scales 
quadratically in the norm ||xo||. Unless special weights in the state space are 
intended, it suffices to let xq live on the unit ball in M". For 5 -^ oo, the problem 
touches on the optimal Hanliel norm reduction [2]. 



18.3.2 Solution 

Given E, and xq, the input u which reaches xq from the zero state in time S is given 
by 

u{t) = B'e-^''gi-^xo (18.26) 

where i% is the reachability Gramian over (— 8, 0). Let the reduced system E,- be 
steered by input u, and set up a state xq (also from its zero state). It follows then 
that the second integral in (18.25) is expressible as a quadratic form in xq 

s s 

^(xo) = J\\y~ yf df = y llCe^'xo - Cre^'%f dt. 



Hence, 



/= / \\u-ufdt + ^'{xo). (18.27) 

It is further known that the specific training input u has energy and if „ = x'qM^^xq, 
and that likewise (f ,, — x^qOxq. The reduced LQ optimization problem is then 
restated as the minimization of 



\\u - u\\^ dt + x'f^Gxo -Ix'^OsXa+x'^Oxo, (18.28) 

-<5 
where &s is the mixed observability Gramian defined by 

s 
(9,= I t^''C'Cr&'^''dt. (18.29) 



It satisfies the Sylvester equation 
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A'C?, + GsAr + C'Cr - e^'^C'Cd^''^ = 0. (18.30) 

The following theorem provides a bound to the above problem solution in terms 
of the SIB(^) realization. 



Theorem 2 If the SIB(S) canonical Gramian is partitioned as 



with 



Ai 

IIA2II/IIA1II = £ and e < 1, then the balanced truncation will yield a mismatch 
J — 0(e) for sufficiently large 6. 



Proof The problem of minimizing (18.28) is a standard LQ problem, with 
Hamiltonian 

H=\\u-il\\^ + A!{ArX + BrU) (18.31) 

and terminal manifold 

^ = jt^Q^xo - 2^(9^X0 + x'f^GxQ. (18.32) 

It follows from the optimality conditions and Euler-Lagrange equations that 

(/ + Md)xQ = {m,3^r^ + ^G',)xo. (18.33) 

Substituting in 

M - M = B[e-'^'''[dio - &[xq]. (18.34) 

"squaring up" and integrating, we get after some manipulations J{xq) as a qua- 
dratic form with weight matrix 

^-' [S^^d - 0i(9s][I+ ^&r^^[I+&^r^[&^s - G\3k\m-^ + G 



' &s3i][I + G SiY 3i-^[I + ^GY'll^^g^-^ + i^G'^] 



(18.35) 



where we introduced the mixed reachability Gramian 

d 
m,= /"e'*''fi,B'e'^''df. (18.36) 



If the system Z is in SIB((5)-balanced form with canonical Gramian A parti- 
tioned as given, then take for Er its SIB(^)-balanced truncation. As there is no 
guarantee for its optimality, we shall use this reduced model to establish an upper 
bound on /. The truncation of an SIB((5) realization is not balanced. However, if E 
is asymptotically stable, it follows from Theorem 1 that the chosen Z, is 
asymptotically stable for sufficiently large (5. Let then (5 be such that ||e'^'*|| <^z. 
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and thus, using the relations (18.15) and (18.16), the ^-reachability Gramian PAr of 
Sr satisfies the order bound 

A,(Ai - .#,) + (Ai ~ .^,)a; = 0{^). (18.37) 

It follows that Ai — M,- = 0(e), and similarly for the observability Gramians. 
A similar argument then establishes the £ closeness of (9^ and Ms respectively to 
[Ai,0] and [Ai,0]. Substituting these approximations in (18.35) shows that this 

"0 

A2 

order e. D 



non-optimal 7 is a term of order e away from xj. 



xq, which is itself of 



Remark The same argument as used in the proof shows that in fact, the 
(5-Gramians are order s close to the infinite time Gramians. Hence, for stable A, one 
could work with realizations balanced in the sense of Moore. Such a realization 
will then approximately be SIB((5) balanced as well if S is sufficiently large. The 
benefit of the SIB-realizations is that they may be computed via direct integration, 
and they also shed light on the time varying and nonlinear extensions. Theorem 2 
motivates the use of the balanced truncation as the reduced order model. 



18.4 Nonlinear Model Reduction 

The discussion in the previous sections set the stage for an extension of the 
balanced truncation for nonlinear system. In order to avoid any pathological cases, 
the class of systems is restricted to analytic system, and for simplicity, we limit 
this further to siso systems. Let thus the system 

x^f{x,u) (18.38) 

y^h{x) (18.39) 

be given, with/ and h analytic. This implies that convergent Taylor series exist. 
Let further a class of "typical" initial states be given, say some domain V GX. 
The rationale behind this is that if V is not too large, one could focus attention on 
some nominal input, Wnom, say defined with respect to the system with initial 
condition xq, an average over V. Deviations form the nominal input are assumed 
"small". This implies that the system can be "recentered" , by incorporating this 
Mnom directly in the model to yield 

i=/(x, Mnom+ m)=/W +gWM (18.40) 

y^h{x) (18.41) 

Consider the linear variational system in the tangent space moving with the 
nominal state along the nominal trajectory. In [7], and references therein, it was 
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shown that the instantaneous reachability and observability matrices can be 
defined associated with system (18.41), with the use of the gradient, d, Lie 
derivative, Lf, and Lie product [/, g] = gf/ — g^^ operations, with ad^+'g = 
[/■,ad^g], and ad)!g = g. 

Remarks 

1. If the system has strict relative degree n at xq, i.e., (see for instance [5]), 

LgL'jh{x) = 0, Va; e U{xq), / = 0, . . ., m - 2 

LgL}-'h{xQ) ^ 0, 

then g, adfg, . . ., ad^^'g are linearly independent at xq. For linear systems the 
relative degree is the pole zero excess. 

2. A system may fail to have relative degree at some points. 



Definition 1 The nonlinear local reachability and observability matrices are 
respectively 



Roc W = [g, ad^fg, ..., ad"_/g, . . .]^ 
OU^) = [{dh)', {dLfh)',..., {dL''-'h)\ 



(18.42) 
(18.43) 



Denote their truncated (n-component) versions by 

dh 

dLfh 



0„(f,h) 



dLj-^h 



Definition 2 The local reachability and observability Gramians are respectively 

s 

Ms{x) = /(e"''-/g),(e"'»-/g)> (18.44) 



O 

^<5W = I {d&'^h)l{de'^fh)^dx. 



(18.45) 



Motivation for these definitions stems from the linear variational system, which 
has time varying reachability matrices and Gramians. When mapped back to the 
original nonlinear model, they have the form given in the definitions. The 
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exponential of the operators appearing in the Gramians is then defined via the 
series expansion. The Gramians (18.44, 18.45) satisfy Lyapunov-Uke equations 
(locally at.c) [7]: 

18.4.1 Local Balancing 

Locally, at X ^ Y, the balancing transformation T(x) can be found from the Gra- 
mians {,^s{x), <5^(x)). The elements of the canonical Gramian A{x), now defined 
at each state x, are the positive square roots of the eigenvalues of the product 
(9s{x)^s{x)- If the system has strict relative degree n, the computation can be 
simplified by taking the first n terms in the series expansions and expressing 
Gramians as weighted "squares" of the reachability and observability matrices as 
suggested in (18.9). 

Assume the system starts from rest (xgq = 0), and we want to analyze its input- 
output behavior. For small u, let U be the union of the reachable sets for all t < tf, 
with tf sufficiently small so that the entire space is not covered. U is some tubular 
neighborhood of the nominal trajectory. Let, for each x e t/, the corresponding 
canonical Gramian and local balancing transformation be known. In a patch where 
the dominant part of A has dimension r, the space can be foliated into the span of 
the dominant directions, say X^. Hence U\oc ~ (spanZr + span/V(x)) n U, where 
N{x) is the (n — r)-dimensional space normal to the dominant subspace. By 
construction, different points (states) on a single leaf can be reached from each 
other, and can be discerned easily. However, it is hard to jump from leaf to leaf 
(move in the normal directions, or resolve states along the normal directions. In 
this sense, a single leaf suffices as approximation for the state space model, and 
thus the approximation of the input output behavior. Motivated by the linear case 
where the redundant subspace is collapsed, we want to postulate the existence of a 
Riemannian metric (f'A(x)^ on the state manifold [12]. 



18.4.2 Global Model Reduction 

A problem remains in that the reduction, given by a low dimensional manifold, is 
still embedded in the full space. Whereas the dominant sub manifold is clearly 
identifiable, there still is no intrinsic lower dimensional set of state equations to 
make the reduced order model global. 
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Let's approach the problem from another starting point: Let (^ be a diifeor- 

morphism in a patch of the (full) state manifold. Denote the inverse of ^{x) = ^ by 
x{c) — X. Then the system (18.41) is represented by 






^8x' 

m 



A 



"=/(c)+g(C> (18.46) 



hixiO)^m. (18.47) 



Locally, the diffeomorphism is determined by the Jacobian J{x) = i^. Of course, if 
we want the ^ coordinates to be balanced in some global sense, we would naturally 
require that J(x) corresponds to the local balancing transformation, T(x). This 
implies that ^(x) should be an integral of T(x). However, the system 

g=r(x) (18.48) 

is a Mayer-Lie type system of partial differential equations, which has no solution 
unless the Frobenius conditions hold 

^-^ = (18.49) 

OXk OXj 

In [9], a method was suggested based on the approximation of the local balancing 
transformation by an integrable Jacobian (Mayer-Lie interpolation). 

So, let us now assume that the system (18.47) is a good approximation of 
(18.41) in balanced coordinates. Following the philosophy in Sect. 18.3 project the 
state ^ to [(^,-, 0]' where dimc^,. ^ r<n, and discard the equations of the rate of 
change in the last n — r components of ^. The resulting set gives a global (actually 
only in a patch) reduced approximation of dimension r. 

<^,=/([c:,0]')+^([^',,0]> 4 F{Q + G{Qu (18.50) 

y^h{[CQ-^)^h{Q. (18.51) 



18.4.3 Comments 

As all computations of observability and reachability matrices and the computation 
of the local balancing transformation must be done symbolically, it cannot be 
expected that truly very large scale systems will easily be reduced unless the 
system have a specific simplifying structure. Such an example was explored in 
[10]. However, from the point of view of sensitivity analysis with respect to 
parameters in a nominal system, the method may turn out to be viable. 
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